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EP 1 098 003 A2 

Description 

HELD OF THE INVENTION 

5 [0001 ] The present Invention relates to a method for identification or detection of slow growing mycobacteria having 
a large number of clinical cases as causative microorganisms of tuberculosis and atypical mycobacteria! disease (espe- 
cially, tubercle bacilli group bacteria), which utilizing a nucleotide sequence of a DNA coding for DNA gyrase p subunit 
(to be referred to as "gyrB gene" or "oyrB" hereinafter). The identification and detection methods of the present inven- 
tion are useful in various industrial fields, such as medical science, Immunology and veterinary science. 

10 

BACKGROUND ART 

[0002] A plurality of species belonging to slow growing mycobacteria are known as acid-fast bacterial species 
which cause tuberculosis and diseases analogous to tuberculosis in human. Among aii, Mycobacterium tuberculosis 
15 complex, Mycobacterium avium complex and Mycobacterium kansasii occupy the most part of clinical cases. Recently 
these bacteria are causing a serious problem for the prognosis of patients of acquired immunodeficiency syndrome 
(AIDS), because they induce systemic disseminated infection in AIDS patients. 

[0003] Conventionally, identification and detection of these bacterial species have been carried out by physiological 
and biochemical methods based on the culturing. For example, identification and detection have been carried out using 

20 such differences in color development because (1) since there are three groups in the slow growing mycobacteria, 
namely a group which develops yellow color only when it is cultured in the dark after irradiation with light (photochro- 
mogen), (2) a group which develops color even when cultured without irradiation of light (scotochromogen) and (3) a 
group which does not develop color even when light is Irradiated (achromogen). Known methods include identification 
and detection based on the ability of cultured bacteria to produce catalase or show urease activity, Tween hydrolyzing 

25 activity or nitrate reducing activity. 

[0004] Tuberculosis is a most important infectious disease, because ninety million people in the world newly con- 
tract the disease every year and thirty million people of them die every year, but its countermeasure is not sufficient. 
Tubercle bacilli as its causative agents are classified Into four strains, namely a tubercle strain (Mycobacterium 
tuberculosis) , a bovine type strain {Mycobacterium bovis) and an Africa strain (Mycobacterium africanum) which are 

30 pathogenic for human and a rat type strain (Mycobacterium microti) which ts not pathogenic for human. Conventionally, 
tests of acid-fast bacteria including these mycobacteria were mainly carried out by smear staining by the Ziehl-Neelsen 
method, isolation culturing method using Ogawa medium and a drug-sensitivity test. With the development of tech- 
niques thereafter, BACTEC 460 TB System, Septl-Check AFB, MGIT (Mycobacteria Growth indicator Tube) and the like 
novel culture techniques have been developed. 

35 [0005] However, these tests require pure culture, in addition, because phenotype to be compared is liable to 
change, the judgment often becomes subjective. As a result, not only a prolonged period of time is required but also 
accurate judgment of the species is extremely difficult In order to solve such problems, certain identification and detec- 
tion methods have recently been considered and put into use, e.g., a method for judging the presence of a specific 
nucleotide sequence of a gene utilizing the polymerase chain reaction (to be referred to as °PCR" hereinafter) or the 

40 like, sub-classification of mycobacteria using an insertion sequence ISfiUfl, and the like. The PCR method is suited for 
identifying and detecting slow growing mycobacteria from the viewpoint that quick and objective judgment Is possible 
without requiring culturing. 

[0006] In that case, the gene to be used is a rRNA gene in most cases. T. Rogall etal. (1990, J. Gen. Microbiol., 
136, 1915 - 1920) have proposed a method for the identification of mycobacteria species based on PCR using 16S 

45 rRNA sequences. However, these primers could not distinguish between Mycobacterium gastri and Mycobacterium 
kansasii, which show different phenotype characteristics. On the other hand, B. Boddinghaus et al. (1 990, J. Clin. 
Microbiol., 28: 1751 - 1759) have reported an oligonucleotide derived from 16S rRNA sequence, which is specific for 
human type mycobacterium group, avian type mycobacterlum-paramycobacterium and Mycobacterium intracellulars 
group. Even the use of this oligonucleotide could not give necessary resolution for carrying out identification at species 

so level. An identification method using these rRNA gene sequences is now on the market and available from Nippon 
Roche as a gene diagnosis kit under a trade name of "Amplicore Mycobacterium". In addition to this, detection or iden- 
tification methods using rRNA sequences have been disclosed by Toyobo (JP-A-1 0-3231 89; the term "JP-A" as used 
herein means an "unexamined published Japanese patent application") and Becton, Dickinson and CO. (JP-A-1 0- 
057098). In order to solve the aforementioned problem of not being able to distinguish two species, an identification or 

55 detection method using the sequence of a region between 1 6S rRNA and 23S rRNA has been proposed by A. Roth et 
al. (1998, J. Clin. Microbiol., 36: 139 - 147). However, since the region between 16S rRNA and 23S rRNA has only 
about 200 base pairs, it is difficult to carry out high accuracy molecular phylogenic analysis by such a short sequence, 
and, when a new strain having an intermediate sequence which does not coincide with any one of the sequences of two 
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V. 

strains is generated, It Is not able to judge Its closeness to which of them. 

[0007] On the other hand, It was shown that more minute and accurate classification and Identification of many bac- 
teria including those of the genus Pseudomonas and the genus Adnetobacter is possible by using a gene which 
encodes a protein having high evolution rate, particularly a 1,200 bp sequence of gyrB gene (Yamamoto, S. and S. 

5 Harayama, 1995, Appl. Environ. Microbiol., 61: 1104 - 1109, Yamamoto, S. and S. Harayama, 1996, Int. J. Syst. 
Bacterid., 46: 508 - 511, Harayama, S. and S. Yamamoto, 1996, pp. 250 • 258 In Molecular Biology of Pseudomonas, 
T. Nakazawa, K. Fukuda, D. Haas, S. Silver (eds), ASM Press, Washington, D.C., S. Yamamoto and S. Harayama, 
Kagaku-to-Seibutsu (Chemistry and Biology, Japan), 1996, vol. 34, no. 3, pp. 149 - 151, S. Yamamoto and S. 
Harayama, Nippon Nogei Kagaku Kalshi (Journal of Agricultural Chemistry, Japan), 1 997, vol. 71 , no. 9, pp. 894 - 897). 

10 [0008] Attempts have been made to carry out identification of slow growing mycobacteria using genes coding for 
proteins other than the gyrB gene. For example, C.T. Shlvannvar et al. have discussed on the phylogenic relationship 
among slow growing mycobacteria and their relationship to antigenicity using superoxide dismutase gene (1994, J. Clin, 
Microbiol., 32: 2801 - 2812), and D.S. Swanson era/, have attempted to carry out minute classification of avian type 
mycobacterium-paramycobacterlum and Mycobacterium intracellulars group using a 65 kD heat shock protein gene 

75 (1997, Int. J. Syst. Bacterid., 47: 414 - 419). In addition to the rRNA gene, Abbott Laboratories, USA, has disclosed a 
detection method which uses a gene coding for a protein antigen B of Mycobacterium tuberculosis, gene sequences of 
65 kD heat shock protein, 10-kD heat shock proteins and the like of Mycobacterium tuberculosis and sequences related 
to insertion sequences IS987 and IS6110, in J P-W-1 0-500567 (the term "JP-W" as used herein means an "Japanese 
national publication of a PCT application") (International Publication No. WO 95/31571). In addition, Becton, Dickinson 

20 and Co. has disclosed in JP-A-06-319560 a detection or identification probe derived from a gene which encodes a 70 
kD heat shock protein of Mycobacterium paratuberculosis. However, among these genes, only the gyrB gene shows 
no contradiction when molecular phylogenical data are compared with the Identification of species by the conventional 
taxonomic means (Yamamoto and Harayama, 1998, Int. J. Syst. Bacterid., 48: 813 - 819, Yamamoto etal., 199, Int. J. 
Syst Bacteriol., 49: 87 - 95, Suzuki etal., Int. J. Syst Bacterid., in press, Kasai era/., Int. J. Syst. Bacterid., in press). 

25 [0009] A patent application relating to a method for the identification or detection of bacteria using the gyrB gene 
has already been filed by the present applicant (JP-A-1 1-1 691 7). However, this document does not disclose identifica- 
tion and detection of slow growing mycobacteria and also does not teach or suggest which region of the gyrB gene can 
be used in carrying out Identification and detection of slow growing mycobacteria. 

[0010] Because the slow growing mycobacteria include the bacteria that cause tuberculosis and the like serious 
30 diseases, great concern has been directed toward the development of a method for accurately identifying and detecting 
this bacterial group. On the other hand, because the growth rate of slow growing mycobacteria is lower than that of gen- 
eral bacteria, it is difficult to identify or detect them by physiological and biochemical methods which essentially require 
cuituring of bacteria. 

[0011] The present Invention has been accomplished under such technical background to provide a method for the 
35 identification or detection of slow growing mycobacteria, especially tubercle bacilli group bacteria, utilizing the gyrB 
gene. 

[001 2] With the aim of solving the aforementioned problems, the present inventors have conducted extensive stud- 
ies and, as a result, found that at least a part of the nucleotide sequence of gyrB DNA is different among the slow grow- 
ing mycobacteria. 

40 [0013] The present inventors further determined gyrB gene sequences of standard strains of the slow growing 
mycobacteria. Taxonomic positioning of strains isolated from clinical cases was carried out based on these sequences. 
Then, the resulting taxonomic positioning was checked by the DNA-DNA hybridization method, which is a standard 
method for identifying species of bacteria. As a result, it was unexpectedly found that the taxonomic positioning deter- 
mined by using gyrB gene sequences shows good agreement with the result of the conventional classification method. 

45 [0014] In addition, nucleotide sequences of gyrB fragments were determined by the PCR method by amplifying 
them from DNA samples of standard strains of atypical mycobacteria, Mycobacterium gastri and Mycobacterium 
kansasil, which cannot be distinguished by the nucleotide sequence of 16S rRNA gene which is the most generally 
used gene sequence-aided detection method of bacteria. When the resulting sequences were compared, it was found 
that the 1 6S rRNA gene sequence was identical In both strains, but 66 positions in the 1 ,257 base gyrB gene nucleotide 

so sequence were different in both strains (Figs. 1-11). The present inventors further found that the taxonomically near 
bacteria belonging to the slow growing mycobacteria can be distinguished by designing primers based on such differ- 
ence in their sequences, which renders possible the PCR amplification specific for each of these strains. Thus, the 
present inventors found that it can determine accurate molecular phylogenic position of even a newly isolated strain and 
also can distinguish related species which cannot be distinguished by other genes, so that it is a method superior to 

55 methods by other genes. 

[0015] The present invention has been accomplished based on the above knowledge. 
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SUMMARY OF THE INVENTION 

[0016] Thus, the present invention relates to a method for identifying slow growing mycobacteria, especially tuber- 
cle bacilli group bacteria, which comprises carrying out identification of bacteria using gyrB DNA as a marker. Also, the 
5 present invention relates to a method for detecting slow growing mycobacteria, especially tubercle bacilli group bacte- 
ria, which comprises carrying out detection of bacteria using gyrB DNA as a marker. 

[0017] The present invention further relates to a method for identifying slow growing mycobacteria, which com- 
prises amplifying the regions corresponding to SEQUENCE NO. 7, 9, 1 1 , 13, 15, 17, 19, 21, 23, 25, 27, 29. 31, 33, 35, 
37 and 39 in the gyrB of slow growing mycobacteria, comparing nucleotide sequences of the amplified fragments with 
io the nucleotide sequences described in SEQUENCE NO. 7, 9, 1 1 , 1 3, 1 5, 1 7, 1 9, 21 , 23, 25, 27, 29, 31 , 33, 35, 37 and 
39, thereby calculating genetic distance from each sequence, and carrying out identification of the aforementioned slow 
growing mycobacteria based on the genetic distance. 

[001 8] Also the present invention relates to a method for detecting a specific bacterium belonging to the slow grow- 
ing mycobacteria using a specific sequence in the gyrB. In particular, the present invention relates to a method for 
is detecting Mycobacterium kansasii, which comprises detecting Mycobacterium kansasii using, as a primer or probe, an 
oligonucleotide that contains a sequence coding for a part of or the entire portion of the amino acid sequence described 
in SEQUENCE NO. 4, or its complementary sequence, and also substantially functions as a primer or probe, and to a 
Mycobacterium kansasii detection kit which comprises the just described oligonucleotide. 

[001 9] The present invention also relates to a method for detecting Mycobacterium gastri, which comprises detect- 
20 ing Mycobacterium gastri using, as a primer or probe, an oligonucleotide that contains a sequence coding for a part of 
or the entire portion of the amino acid sequence described in SEQUENCE NO. 6, or its complementary sequence, and 
also substantially functions as a primer or probe, and to a Mycobacterium gastri detection kit which comprises the just 
described oligonucleotide. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

[0020] - 

Figs. 1 through 1 1 show alignment of the nucleotide sequences of various slow growing mycobacteria. The sym- 
30 bols at the left side in the figure Indicate the organisms shown below. 

KPM 1 403 Mycobacterium simiae 

KPM1 201 Mycobacterium marinum 

KP M220 1 Mycobacterium gordonae 
35 ATCC25274 Mycobacterium asiaticum 

KPM2027 Mycobacterium scrofulaceum 

KPM2403 Mycobacterium szuigai 

KP M30 1 2 Mycobacterium avium 

Bovlnel 0 Mycobacterium paratubercufo 
40 KPM31 01 Mycobacterium intracellular 

KPM3401 Mycobacterium malmoense 

ATCC51 789 Mycobacterium branderi 

T801 Mycobacterium africanum 

T901 Mycobacterium microti 

45 T704 Mycobacterium bovis 

T021 Mycobacterium tuberculosis 

KPM3504 Mycobacterium gastri 

KPM 1001 . Mycobacterium kansasii 

so Fig. 1 2 shows a result of the identification using the primers based on SEQUENCE No. 1 , SEQUENCE NO. 3, and 
SEQUENCE No. 5. Panel A shows amplified results using Mycobacterium tensasff-specific primers (SEQUENCE 
NO. 1 and SEQUENCE NO. 3), and panel B using Mycobacterium gasfrf-spedfic primers (SEQUENCE NO. 1 and 
SEQUENCE NO. 5). Lanes 1 and 12 are molecular weight markers. Lanes 2: strain KPM 1001T, 3: strain KPM 
1004, 4: strain KPM 1007, 5: strain KPM KY256, 6: strain KPM KY761 , 7: strain KPM KY798, 8: strain KPM 1998- 

55 1,9: strain KPM 3504T, 1 0: strain KPM 3502 and 1 1 : strain KPM 3503. 

Fig. 13 shows a phylogenetlc tree of slow glowing mycobacteria prepared by the molecular phylogenic analysis. 
This figure shows an example in which the presence of new species of slow growing mycobacteria was shown by 
gyrB sequence analysis. By carrying out molecular phylogenetlc analysis and comparing the thus obtained gyrB 
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sequences with already known gyrB sequences, It was shown that a group of strains KPM 2212, 2014, 1988-5, 
2209 and 2013 are new species. 

Fig. 14 Is an electrophoresis photograph of products amplified by PCR using primers specific for bacteria which 
constitute the tubercle bacilli. 

5 Rg. 1 5 Is an electrophoresis photograph of products amplified by PCR using primers specific for each bacterium 

which constitutes the tubercle bacilli. 

Rg. 16 is an electrophoresis photograph of fragments prepared by digesting PCR products with restriction 
enzymes. 

w DETAILED DESCRIPTION OF THE INVENTION 

[0021] The following describes the present Invention in detail. 

[0022] As the first step for the detection or Identification of the slow growing mycobacteria (especially tubercle bacilli 
group bacteria), a sample for detection or identification is collected. Examples of the sample include a sample collected 

15 from organisms (human, animals, etc.) showing tuberculosis or tuberculosis-analogous symptoms as well as a strain 
isolated from the sample. Examples of tuberculosis or tuberculosis-analogous symptoms include pneumonia, 
empyema, cystitis, pyelonephritis, prostatitis, peritonitis, pericarditis, meningitis, encephalitis, etc. (Pocket guide to clin- 
ical microbiology 2nd edition, Oatrick R. Murray, ASM press). The collected sample may be cultured or the microorgan- 
ism in the sample may be Isolated and cultured for the use in the following steps. However, the present invention is 

20 advantageous in that the sample as it is can be used. 

[0023] Then, a sample or isolated microorganism is usually subjected to a treatment to destroy the cell to extract 
DNA from the cell. The method for this treatment is not particularly limited and includes physically destroying method, 
chemically destroying method, etc. 

[0024] The sequence of the DNA gyrase p subunlt in the sample is determined by using the conventional way. 

25 Examples of the method for determining the DNA sequence include the dideoxy terminator method (Molecular Cloning: 
a laboratory manual 2nd edition, J. Sambrook, E. F. Rritsch, T. Maniatis, CSH press). The sequence determined is com- 
pared with the sequences of the DNA gyrase (5 subunit of the slow growing mycobacteria (Sequence No. 7, 9, 1 1, 13, 
1 5, 1 7, 1 9, 21 , 23, 25, 27, 29, 31 , 33, 35, 37, 39) to determine whether the microorganism in the sample belongs to the 
one of these slow growing mycobacteria or Is a new species. For the determination, 85% to 100% homology with 

30 respect to the 1200 bp sequence of the DNA gyrase p subunlt means the same species, while the homology less than 
85% means a new species. As a surprising matter, the identification of the slow glowing mycobacteria based on the 
DNA gyrase p subunit well matches with the Identification of the slow glowing mycobacteria by the conventional 
method. Thus, the present invention makes it possible to identify the slow growing mycobacterial in an accurate way 
and also makes it possible to distinguish the reiated species, which are not possible by the conventional way. 

35 [0025] Without determining the whole sequence of the DNA gyrase p subunit, identification and detection of the 
slow growing mycobacteria according to the present invention can be carried out by utilizing one or more unique partial 
sequence in the DNA gyrase p subunit which is characteristic to one or more of the microorganisms belonging to the 
slow growing mycobacteria or related species thereof. Example of the unique sequence is a sequence having 0 or 
more, preferably at least 1 , more preferably at least 2, and most preferably at least 3 unique bases in the sequence hav- 

40 ing a length of 5-mer to 50-mer, preferably 10-mer to 40-mer, more preferably 15-mer to 30-mer. When the unique 
sequence does not have a unique base, at least one unique base should exist at the 3'-side or 5'- side nearest neighbor 
base to the unique sequence. The complementary sequence to the unique sequence can be also used. 
[0026] The unique base means a base which can be found in only one or only several related species among the 
slow growing mycobacteria. The unique base may be located at arbitrary position in the unique sequence. When the 

45 unique sequence is utilized as a primer for the PCR, a unique base located near the 3'-end is preferable for the 5-end 
primer and a unique base located near the 5-end is preferable for the 3'-end primer. For the method utilizing the gel 
electrophoresis described below, the unique sequence may be designed so that the 3'-side or 5'-side nearest neighbor 
base to the unique sequence is the unique base (i.e., the unique base Is not contained in the unique sequence). Even 
if a unique sequence which is unique to a certain one species among the slow growing mycobacteria is not found, Iden- 

50 tlficatlon or detection of the certain species is possible by using in combination two or more unique sequences which 
are respectively unique to several species among the slow growing mycobacteria. For example, four tubercle bacilli 
group bacteria can be identified or detected by using Sequence 41 shown In Rg. 1 . By using Sequence 55 In combina- 
tion of Sequence 41 , It is possible to Identify or detect Mycobacterium microti. Mycobacterium kansasii and Mycobac- 
terium gastri can be Identified or detected by using Sequence 1 shown in Rg. 1. By using Sequence 3 in combination 

55 of Sequence 1 , it is possible to identify or detect Mycobacterium kansasii. According to the present invention, a sample 
obtained from human or animals showing tuberculosis or tuberculosis-analogous symptoms is used. Accordingly, it is 
possible to avoid pseudo positive reaction even if there are microorganisms other than slow growing mycobacteria that 
have the same unique sequence in the DNA gyrase p subunit 
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[0027] Examples of the concrete methods for identifying or detecting the slow growing mycobacteria, which utilizes 
the unique sequence, a partial sequence in the unique sequence, or a sequence having a unique sequence, in the DNA 
gyrase p subunit include (1) DNA chip (DNA mlcroarray) (Gingeras et al., 1998, Genome Res. 8: 435-448; Troesch et 
al. 1999 J. Clin. Microbiol. 37: 49-55), (2) PCR using the same as primers (Kasai, H., Ezaki, T., Harayama, S. 2000. J. 
Clin. Microbiol. 38: 301-308), (3) hybridization using the same as a probe (de los Reyes et al. 1997. Appl. Environ. 
Microbiol. 63: 11007-1117), (4) cleavage by the restriction enzyme that recognizes the unique sequence (Kasal H., 
Ezaki, T., Harayama, S. 2000. J. Clin. Microbiol. 38: 301 -308), and the like. Examples of the method to confirm the result 
of these methods include a method to confirm the existence of the amplified or cleaved fragments by the ge! electro- 
phoresis, a method using DNA chip (DNA microarray), etc. The above-described methods can be carried out by the 
known way (cf. Molecular Cloning: a laboratory manual 2nd edition, J. Sambrook, E. F. Fritssh, T. Maniatis, CSH press; 
Current protocols of molecular biology edited by Ausubel et al. Wiley; PCR primer - A laboratory manual, edited by D'rf- 
fenbach & Dveksler. SCH press, all herein incorporated by reference) The identification method and the detection 
method according to the present invention are further described below. 



15 (1 ) Identification method 



20 



25 



30 



[0028] The method of the present invention for identifying slow growing mycobacteria is characterized in that the 
regions corresponding to SEQUENCE NOS. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37 and 39 in the 
gyrB of slow growing mycobacteria are amplified by PCR, nucleotide sequences of the amplified fragments are com- 
pared with the nucleotide sequences described in SEQUENCE NOS. 7, 9, 1 1 , 1 3, 15, 1 7, 1 9, 21 , 23, 25, 27, 29, 31 f 33, 
35, 37 and 39, thereby calculating genetic distance from each sequence, and then identification of the aforementioned 
slow growing mycobacteria Is carried out based on the genetic distance. 

[0029] The term "identification" as used herein means that taxonomic positions of bacteria are determined by a 
molecular phylogenic or the like means. 

[0030] Though not particularly limited, the primers represented by SEQUENCE NO. 59 and SEQUENCE NO. 60 
can be exemplified as the primers to be used in the amplification of the regions corresponding to SEQUENCE NOS. 7, 
9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37 and 39 in the gyrB. 

[0031] Relationship between the nucleotide sequences of SEQUENCE NOS. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31 , 33, 35, 37 and 39 and their corresponding amino acid sequences and names of original microorganisms is 
shown in the following table. 



TABLE 1 



35 



40 



45 



50 



55 



Nucleotide sequence 


Amino acid sequence 


Name of original microorganisms 


SEQUENCE NO. 7 


SEQUENCE NO. 8 


Mycobacterium simiae 


SEQUENCE NO. 9 


SEQUENCE NO. 10 


Mycobacterium bovis 


SEQUENCE NO. 11 


SEQUENCE NO. 12 


Mycobacterium szulgai 


SEQUENCE NO. 13 


SEQUENCE NO. 14 


Mycobacterium malmoense 


SEQUENCE NO. 15 


SEQUENCE No. 16 


Mycobacterium Intracellulars 


SEQUENCE NO. 17 


SEQUENCE NO. 18 


Mycobacterium avium 


SEQUENCE NO. 19 


SEQUENCE NO. 20 


Mycobacterium gordonae 


SEQUENCE NO. 21 


SEQUENCE NO. 22 


Mycobacterium africanum 


SEQUENCE NO. 23 


SEQUENCE NO. 24 


Mycobacterium tuberculosis 


SEQUENCE NO. 25 


SEQUENCE NO. 26 


Mycobacterium gastri 


SEQUENCE NO. 27 


SEQUENCE NO. 28 


Mycobacterium marinum 


SEQUENCE NO. 29 


SEQUENCE NO. 30 


Mycobacterium microti 


SEQUENCE NO. 31 


SEQUENCE NO. 32 


Mycobacterium asiaticum 


SEQUENCE NO. 33 


SEQUENCE NO. 34 


Mycobacterium scrofuiaceum 


SEQUENCE NO. 35 


SEQUENCE NO. 36 


Mycobacterium branderi 


SEQUENCE NO. 37 


SEQUENCE NO. 38 


Mycobacterium paratuberculosis 
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TABLE 1 (continued) 



Nucleotide sequence 


Amino acid sequence 


Name of original microorganisms 


SEQUENCE NO. 39 


SEQUENCE NO. 40 


Mycobacterium kansasii 



[0032] The genetic distance can be calculated in accordance, for example, with the method described by Felsen- 
stein in the Phylip program (Felsenstetn, J., 1993 PHYLIP (Phylogeny Inference Package) version 3.5c. Distributed by 
the author, Department of Genetics, University of Washington, Seattle, U.S.A.). 



10 (2) Specific detection 

[0033] The method of the present invention for detecting Mycobacterium kansasii is characterized by the use, as a 
primer or probe, of an oligonucleotide which contains a sequence coding for a part of or the entire portion of the amino 
acid sequence described in SEQUENCE NO. 4, or its complementary sequence, and functions substantially as a 
is primer or probe. Also, the Mycobacterium kansasii detection kit of the present invention is characterized in that it con- 
tains the Just described oligonucleotide. 

[0034] The method of the present invention for detecting Mycobacterium gastri is characterized by the use, as a 
primer or probe, of an oligonucleotide which contains a sequence coding for a part of or the entire portion of the amino 
acid sequence described in SEQUENCE NO. 6, or its complementary sequence, and also functions substantially as a 
20 primer or probe. Also, the Mycobacterium gastri detection kit of the present invention is characterized in that it contains 
the just described oligonucleotide. 

[0035] In this connection, the term "substantially functions as a primer or probe" means that the oligonucleotide has 
such a length that a specific annealing or hybridization can be effected, and its gist is to exclude the oligonucleotide 
which has a sequence that can anneal to or hybridize with the DNA to be detected but cannot be used in specific detec- 
ts tlon, because it frequently causes nonspecific annealing or hybridization due to its short length, in order to confirm that 
a certain oligonucleotide can substantially functions as a primer for PCR, the PCR is carried out at a 3°C higher anneal- 
ing temperature and a 3°C lower annealing temperature than the usually employed temperature forthe PCR. If the PCR 
product is observed only at 3°C lower annealing temperature, there is a possibility of false-positive. In such a case, the 
nucleotide sequence of the amplified fragment Is determined by the conventional way and compared with the known 
30 sequence to confirm whether the oligonucleotide used can substantially work as a primer. In order to confirm that a cer- 
tain oligonucleotide can substantially function as a primer for PCR, it is preferable to perform PCR by using DNA of 
already known strain (for example, type strain) as a template for positive and negative controls. 
[0036] Though not particularly limited, the oligonucleotide represented by SEQUENCE NO. 3 can be exemplified 
as an oligonucleotide which can be used in the detection of Mycobacterium kansasii, and the oligonucleotide repre- 
ss sented by SEQUENCE NO. 5 can be exemplified as an oligonucleotide which can be used in the detection of Mycobac- 
terium gastri. 

[0037] Preparation of DNA to be tested, preparation of primers and PCR using the same, and preparation of probes 
and hybridization using the same can be earned out in the usual way without requiring special techniques. 
[0038] Regarding the primers to be used In PCR, it is not always necessary that both of them can perform specific 
40 annealing, and one of them may perform nonspecific annealing. The primer represented by SEQUENCE NO. 1 can be 
cited as an example of such a primer which performs nonspecific annealing. 

[0039] The methods of the present invention for identifying and detecting the slow growing mycobacteria (espe- 
cially, tubercle bacilli group bacteria) are characterized In that gyrB DNA is used as a marker. Examples of the slow 
growing mycobacteria include those shown in Table 1 . Examples of the tubercle bacilli group bacteria include Mycobac- 
45 terium tuberculosis, Mycobacterium bovis, Mycobacterium africanum and Mycobacterium microti. 

[0040] The following four methods can be exemplified as the Identification and detection methods which use gyrB 
DNA as a marker. 

A) A method Which employs PCR 

50 

[0041] This method is carried out as follows. 

(1) An oligonucleotide which contains a region of gyrB DNA, a region that has different nucleotide sequence among 
tubercle bacilli group bacteria, is synthesized. Since the nucleotide sequence of gyrB DNA corresponding to each 
55 bacterium is already determined as shown In Figs. 1-11, the just described oligonucleotide can be synthesized 
based on these drawings. As the oligonucleotide, an oligonucleotide which encodes the amino acid sequence 
described in SEQUENCE NO. 46, SEQUENCE NO. 48, SEQUENCE NO. 50, SEQUENCE NO. 52, SEQUENCE 
NO. 54, SEQUENCE NO. 56 or SEQUENCE NO. 58 can be exemplified as a preferable oligonucleotide, and an oil- 
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gonucleotide represented by SEQUENCE NO. 45, SEQUENCE NO. 47, SEQUENCE NO. 49, SEQUENCE NO. 51 , 
SEQUENCE NO. 53, SEQUENCE NO. 55 or SEQUENCE NO. 57 can be exemplified as a particularly preferable 
oligonucleotide. 

(2) A solution which contains the oligonucleotide synthesized in the above step, dNTP, DNA polymerase and a bac- 
5 terial DNA to be used as a sample is prepared. Concentration of each component contained in the solution may be 

the same as that in the reaction solution used In general PCR. It is not necessary to purify the bacterial DNA to be 
used as a sample, and disrupted cells may be used as such for example. 

(3) The solution prepared in the above step is repeatedly heated under such conditions that PCR can be generated. 
The heating temperature, cycle and the like conditions are not particularly limited, with the proviso that they are 

io within such ranges that PCR can be effected, but, since the homology of gyrB DNA among tubercle bacilli group 
bacteria is high as shown In Rgs. 1-11, It Is desirable to set the temperature at the time of annealing to a fairly high 
level, illustratively, it is desirable to set at 68°C or more. When the synthesized oligonucleotide can be hybridized 
with the added bacterial DNA, PCR occurs by the repetition of heating and amplified product is formed thereby. On 
the other hand, when the synthesized oligonucleotide cannot be hybridized with the added bacterial DNA, PCR 

is does not occur and amplified product Is not formed. 

(4) Electrophoresis of the solution after the above treatment Is carried out When the amplified product is contained 
in the solution, its corresponding band is formed on the electrophoresis gel. In consequence, identification and 
detection of the bacterium of interest can be made based on the electrophoresis pattern. 

20 B) A method whiPfi uses restriction enzyme digestion fragments 

[0042] This method is carried out as follows. 

(1) An oligonucleotide which is identical to a part of gyrB DNA of tubercle bacilli group bacteria and a complemen- 
ts tary oligonucleotide of the aforementioned part of DNA are synthesized. Since the nucleotide sequence of gyrB 

DNA corresponding to each bacterium is already determined as shown in Rgs. 1-11, the bust described oligonu- 
cleotides can be synthesized based on these drawings. As preferred oligonucleotides, an oligonucleotide which 
encodes the amino acid sequence described in SEQUENCE NO. 42 and an oligonucleotide which encodes the 
amino acid sequence described in SEQUENCE NO. 44 can be exemplified, and the oligonucleotide represented by 
30 SEQUENCE NO. 41 and the oligonucleotide represented by SEQUENCE NO. 43 can be exemplified as particularly 
preferred oligonucleotide. 

(2) PCR is carried out using the two oligonucleotides synthesized in the above step as primers, and a bacterial DNA 
sample as a template. It is not necessary to purify the bacterial DNA to be used as a sample, and disrupted cells 
may be used as such for example. PCR can be carried out in the usual way. 

35 (3) The DNA fragment amplified in the above step is digested with restriction enzymes. The restriction enzymes to 
be used are not particularly limited, with the proviso that they can generate different fragments among correspond- 
ing bacteria which constitute tubercle bacilli. For example, Rsa I and Taq\ can be exemplified as such restriction 
enzymes. 

(4) Electrophoresis of the fragments digested in the above step Is carried out The digested DNA fragments appear 
40 at positions corresponding to their length. In consequence, identification and detection of the bacterium of interest 
can be made based on the electrophoresis pattern. 

[0043] The aforementioned two methods can be cited as typical examples of the identification or detection method 
of the present invention, but other methods are also included in the identification or detection method of the present 

45 invention, with the proviso that they use gyrB DNA as a marker. Examples of these other identification or detection 
methods include a method in which gyrB DNA is amplified by PCR, and identification or detection of the bacterium of 
interest is carried out by determining nucleotide sequence of the amplified fragment and a method In which an oligonu- 
cleotide which contains a region of gyrB DNA, a region that has different nucleotide sequence among tubercle bacilli 
group bacteria, is synthesized, and the bacterium of Interest Is Identified or detected by carrying out Southern blotting 

so using the oligonucleotide as a probe. 

Q) Method which employ? the gel electrophoresis 

[0044] According to this method, easy and qualitative analysis as well as a certain degree of quantitative analysts 
55 are possible for At tuberculosis, M. bovis, M. africanum, M. microti, M. kansasii, M. avium, M. intracellulars and for the 
multiple infection found in the patient having lowered immunological competence. 

[0045] Dideoxy nucleotides and primers that are unique to one or more of these species are used for this method. 
The oligonucleotide used as a primer is designed so that the 3'-side nearest neighbor base in DNA gyrase 0 subunit 
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sequence to the primer Is the unique base. The sequence of the primer Itself may be common In all of the slow growing 
mycobacteria or may be common in one or more slow growing mycobacteria. In the latter case, an oligonucleotide mix- 
ture is preferably used in order to assure the reaction. 

[0046] All of the 4 types of dideoxy nucleotides is labeled with a fluorescent substance or radioactive substance. By 
5 labeling each of the 4 dideoxy nucleotides with different types of substances, it is possible to obtain the necessary infor- 
mation from only one lane of the sequence gel. Labeling the 4 dideoxy nucleotides with the same fluorescent substance 
or radioactive substance requires to carry out electrophoresis using 4 different lanes. 

[0047] The reaction mixture used for this method is the same with the reaction mixture for the usual sequence reac- 
tion except that dATP, dTTP, dGTP, and dCTP are not contained. In other words, the reaction mixture contains, as 

io essential components, an appropriate buffer, a DNA polymerase, a labeled ddNTP, and the primer. A sample collected 
from the patient Is mixed with this reaction mixture and then subjected to the reaction at an appropriate temperature at 
which the reaction can occur (for example, at 95°C for 10 seconds, 50°C for 5 seconds, and 60°C for 4 minutes; 25 
cycles). Then, existence of the labeled primer Is checked, for example, by subjecting the reaction product to the gel 
electrophoresis by the conventional way, or the like. The pattern which appears on the gel differs depending on the 

15 length of the primer used and the type of the 3'-side nearest neighbor based to the primer. The location of the primer 
sequence is not particularly limited as long as the length of the primer sequence is the same. However, for the quanti- 
tative analysis, primers having a quite high ATm (about 5°C or more) is not preferable. Examples of the primers include 
the nucleotides represented by Sequence 61 (5'-gacgcstaygcgatatc-3') based on the M. tuberculosis complex and M. 
kansasii and Sequence 62 (S'-agcggytacaacgtcag) based on M. avium and M. intracellulare in Fig. 1. At the position 

20 corresponding to the 1 8-base length, a signal of T is found in the case of M. tuberculosis complex, "G" for M. kansasii, 
and "C" for M. intracellulare. Detection of a plural number of signals at the position corresponding to the 1 8-base length 
means the multiple infection. Moreover, approximate amount of existence of each species can be estimated from the 
signal intensity detected. The sequence that Is unique to 4 species of the tubercle bacilli group bacteria and the 
sequence which can distinguish M. gastri (a species which is near to M. kansasii but has only a few number of clinical 

25 cases of human infection) have been explained in the above-described methods. By combining this method which uses 
gel eiectophoresis with the above -described methods, further detailed identification or detection is possible. 

P) Method which employs the PNA chip 

30 [0048] The detection or identification of the slow growing mycobacteria is also possible by utilizing the DNA chip. 
Examples of the method which employs the DNA chip is described below. First, the region in the DNA gyrase 0 subunit 
in one or more standard strains of the slow growing mycobacteria is amplified, for example, by the PCR. Then, the 
amplified product is labeled by Cy5 or the Tike, and the synthesized DNA oligo probe is fixed on a plate such as slide 
grass. A DNA in a sample is obtained and subjected to a hybridization reaction on the plate having a solid phrased 

35 probe, which is then subjected to washing and detection in the conventional way. The size of the region in the DNA 
gyrase 0 subunit is preferably 250 bp or less, more preferably 180 bp or less, and still more preferably 125 bp or less. 
The oligo probe size is preferably from 14 to 17 men 

[0049] Known protocols for the method which employs the DNA chip can be employed for the above-described 
method (cf. Lemieux, B., Aharonl, A., and M. Schena (1998), Overview of DNA Chip Technology, Molecular Breeding, 

40 4, 277-289; Schena, M., Heller, R.A., Theriault, T.P., Konrad, K., Lachenmeier, E., and R.W. Davis (1998), Microarrays: 
biotechnology's discovery platform and functional genomics, Trends in Biotechnology, 16, 301-306; and Heller, R.A., 
Schema, M., Chai, A., Shalom, D., Bedilion, T, Gilmore, J., Woolley, D.E., and Davis, R.W. (1997), Discovery and anal- 
ysis of inflammatory disease-related genes using cDNA microarrays, Proceedings of the National Academy of Sciences 
USA, 94, 2150-2155), all herein incorporated by reference). 

45 [0050] Illustrative but non-limiting example of the protocol for the DNA chip method is described below. 

(1) Labeling 

[0051] PCR amplification product (125 bp) is obtained by 40-cycles treatment (each cycle: at 96°C for 1 minute, 
so 55°C for 30 seconds, and 72°C, 2 minutes). The product is then subjected to the ethanol precipitation. Then, 10 u,M 
Cy5-dCTP/100 jil is added instead of dCTP/100 uJ and the 40-cycles treatment was carried out again (each cycle: at 
96°C for 1 minute, 55°C for 30 seconds, and 72°C, 2 minutes). Then, the product is subjected to the ethanol precipita- 
tion. 

55 (2) Spotting 

[0052] Spotting is carried out under the following conditions. 
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Slide: Silylated Slides 

Spotting: Spotting by SPBIO (manufactured by HITACHI) using a 4 Pin head with a pitch of 1.0 mm. A 20 uJ sample 
(1 0 uJ of 200 u.M probe + 1 0 u.1 of x2 Spotting Solution (Arraylt™)) is used for each well of the plate (about 4 to 5 nl 
per 1 spot). 

5 Time: about 10 minutes/slide (96 spots) 

Oligo probe size: cone, probe 14 - 17 mer, final cone. 100 |xM 

(3) Hybridization 

io [0053] Hybridization is carried out using UnlHyb™ (Arraylt™). The labeled product Is dissolved in 4 u.1 of sterilized 
water and 1 6 uJ of x1 .25 UnlHyb™ was added. 

[0054] Then, 9.6 u.l of the resulting mixture was dropped onto a cover slip (24 x 32 mm; 1 .25 uJ/cm 2 ) and the cover 
slip was placed onto the microarray such that bubbles are not included between the cover slip and the microarray. Then, 
the microarray is incubated at 46°C for 4 hours. 

15 

(4) Washing 

[0055] Washing is carried out with 2 x SSC (+ 0.2% SDS) for 5 minutes at room temperature, with 0.1 x SSC (+ 
0.2% SDS) for 5 minutes at room temperature, and then with 0.1 x SSC (+ 0.2% SDS) for 5 minutes at room tempera- 
20 ture. The microarray is centrifuged and dried. At least 1 week storage at 4°C Is possible. 

(5) Scanning 

[0056] Scanning is carried out by using ScanArray 1000 (ScanArray Lite) manufactured by GSI LUMONICS. 

25 

Scanning software: ScanArray 
Analyzing software: QuantArray 

EXAMPLE- 1 

30 

[0057] Using the oligonucleotides represented by the nucleotide sequences described in SEQUENCE NO. 39 and 
SEQUENCE NO. 40, gyrB gene sequences of 8 acid-fast bacterial strains (KPM 2201T, KPM 2202, KPM 2203, KPM 
201 3, KPM 201 4, KPM 1 9BB-5, KPM 2209 and KPM 221 2) isolated from clinical cases were determined. Using the thus 
obtained gyrB sequences and a gyrB sequence set (SEQUENCE NO. 7 to SEQUENCE NO. 40) for slow growing 

35 mycobacteria identification use, their phylogenic relationship was estimated by a molecular phylogenic analysis. The 
molecular phylogenic analysis was carried out in the following manner using general-purpose molecular phylogenic 
analysis programs Clustal W (Thompson, J.D., D.G. Higgins and T.J. Gibson, 1 994, Clustal W: Improving the sensitivity 
of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight 
matrix choice., Nucleic Adds Res., 22: 4673 - 4680) or Phylip (Felsenstein, J., 1993 PHYLIP (Phytogeny Inference 

40 Package) version 3.5c. Distributed by the author, Department of Genetics, University of Washington, Seattle, U.S.A.), 
in accordance with the instructions for use of both programs. A multiple alignment file is prepared by the Clustal W pro- 
gram from the gyrB gene sequences obtained using the oligonucleotides represented by the nucleotide sequences 
described in SEQUENCE NO. 59 and SEQUENCE NO. 60 and the slow growing mycobacteria identification gyrB 
sequence set of SEQUENCE NO. 7 to SEQUENCE NO. 40. An example of the parameters to be used in making the 

45 multiple alignment is "Gap Open Penalty: 15.00; Gap Extension Penalty: 6.66; DNA weight matrix: IUB; DNA transition 
weight 0.5". The thus obtained multiple alignment Is compared with a multiple alignment file obtained from amino acid 
sequences, and questionable points are corrected. Next, the genetic distance between respective sequences is calcu- 
lated based on the multiple alignment file. The dnadist program of Phylip is used for the calculation. The calculation is 
carried out in accordance with the Kimura 2-parameter model. A phylogenetic tree is prepared from the thus obtained 

so genetic distances by a neighboring sequences binding method. Correctness of the phylogenetic tree is checked by cal- 
culating the bootstrap probability. 

[0058] On the other hand, the aforementioned 8 strains were also identified by a 1 6S rRNA gene-aided method and 
a biochemical method. The above results are shown in Table 2. 

55 
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TABLE 2 



Strain name 


Biological test 


16S rRNA gene 


DNA homology test 


KPM 2201T 


M. gordonae 


M. gordonae 


M. gordonae 


KPM 2202 


M. gastri 


M. gordonae 


M. gordonae 


KPM 2203 


M. gastri 


M. gordonae 


M. gordonae 


KPM 2013 


M. scrofulaceum 


M. gordonae 


new species 


KPM 2014 


M. scrofulaceum 


M. gordonae 


new species 


KPM 1988-5 


M. scrofulaceum 


M. gordonae 


new species 


KPM 2209 


M. scrofulaceum 


M. gordonae 


new species 


KPM 2212 


no data 


M. gordonae 


new species 



[0059] As shown in the table, 3 of the above 8 strains, namely KPM 2201T, KPM 2202 and KPM 2203, were iden- 
tified as strains belonging to Mycobacterium gordonae, but the other 5 strains, KPM 2013, KPM 2014, KPM 1988-5, 

20 KPM 2209 and KPM 221 2, were suggested to be sibling species of Mycobacterium gordonae but different species (new 
species) (Fig. 13). When a DNA-DNA hybridization test (Ezaki, T., Hashimoto, Y., Takeuchi, T., Yamamoto, H, Shu-Lin 
Liu, Matsui, K. & Yabuuchi, E (1 988), J. din. Microbiol. , 26, 1 708 - 1 71 3; EzaW, T., Hashimoto, Y, Takeuchi & Yabuuchi, 
E (1 989), Int J. Syst. Bacterid., 39, 224 - 229) was carried out in order to inspect this result, it was supported that they 
are new species. This result suggests that the gyrB sequence set for slow growing mycobacteria identification use gives 

25 highly reliable results for not only known strains but also strains of new species. 

EXAMPLE 2 

[0060] Nucleotide sequences of the gyrB gene of Mycobacterium kansasii and Mycobacterium gastri (Figs. 1-11) 
30 were compared to prepare a primer which specifically anneals to the gyrB gene of Mycobacterium kansasii 
(SEQUENCE NO. 3) and a primer which specifically anneals to that of Mycobacterium gastri (SEQUENCE NO. 5). A 
primer which anneals to the gyrB gene of both strains (SEQUENCE NO. 1) was also prepared. 
[0061] Using these primers, PCR was carried out on disrupted cell suspensions of strains KPM 1004, KPM 1 007, 
KPM KY256, KPM KY761, KPM KY768, KPM 1998-1, KPM 3502 and KPM 3503 isolated from clinical cases. 
35 [0062] The PCR amplification conditions are as follows. 

At 95°C for 10 minutes; 1 cycle 

At 95°C for 1 minute and 68°C for 1 minute and 30 seconds; 30 cycles 
At 72°C for 1 0 minutes; 1 cycle 
40 Primer concentration; 1 u.M for each 
dNTP: 100uJv1foreach 

Ampli Taq GOLD™ and PCR buffer I attached thereto (Perkin Elmer, USA) were used. 

[0063] When the thus amplified DNA fragments were analyzed by an electrophoresis, amplified fragments were 
45 observed only by the combination of SEQUENCE NO. 1 and SEQUENCE NO. 3 in the case of KPM 1004, KPM 1007, 
KPM KY256, KPM KY761 , KPM KY768 and KPM 1 998-1 (Table 3), so that these strains were identified as Mycobacte- 
rium kansasii. In the case of KPM 3502 and KPM 3503, amplified fragments were observed only by the combination of 
SEQUENCE NO. 1 and SEQUENCE NO. 5 (Table 3), so that these strains were Identified as Mycobacterium gastri. 
The electrophoresis patterns used in the Judgment are as shown In Fig. 12. These Identification results coincided with 
so the Identification results of the DNA-DNA hybridization method. 



TABLE 3 





M. kansasii 


M. gastri 


SEQUENCE NO. 1 
SEQUENCE NO. 3 


amplification was possible 


No amplification 
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TABLE 3 (continued) 





M. kansasii 


M. gastri 


SEQUENCE NO. 1 
SEQUENCE NO. 5 


No amplification 


amplification was possible 



EXAMPLE 3 

io [0064] A 10 ng portion of purified DNA was prepared from each of 9 bacterial species Including 4 tubercle bacilli 
group bacterial species and 5 other bacterial species belonging the genus Mycobacterium. PCR was carried out using 
these DNA samples as templates, and the oligonucleotides described in SEQUENCE NO. 41 and.SEQUENCE NO. 43 
as primers. The PCR amplification conditions are as follows. 

is At 95°C for 1 0 minutes; 1 cycle 

At 95°C for 1 minute, 68°C for 1 minute and 72°C for 1 minute; 30 cycles 
At 72°C for 10 minutes; 1 cycle 
Primer concentration; 1 u.M for each 
dNTP: 100 \iM for each 

20 Ampll Taq GOLD™ and PCR buffer 1 attached thereto (Perkin Elmer, USA) were used. 

[0065] The products amplified by PCR were analyzed by an agarose gel electrophoresis. The results are shown in 
Rg. 14. In this connection, the relationship between lanes and bacterial species is as follows. 

25 Lane 1 : Mycobacterium tuberculosis 

Lane 2: Mycobacterium bovis 

Lane 3: Mycobacterium africanum 

Lane 4: Mycobacterium microti 

Lane 5: Mycobacterium kansasii 

30 Lane 6: Mycobacterium gastri 

Lane 7: Mycobacterium abscessus 

Lane 8: Mycobacterium chetonae 

Lane 9: Mycobacterium trviale 

35 EXAMPLE 4 

[0066] A 1 0 ng portion of purified DNA was prepared from each of 4 tubercle bacilli group bacterial species. PCR 
was carried out using these DNA samples as templates, and the oligonucleotides represented by SEQUENCE NO. 45, 
SEQUENCE NO. 47, SEQUENCE NO. 49, SEQUENCE NO. 51, SEQUENCE NO. 53, SEQUENCE NO. 55 and 
40 SEQUENCE NO. 57 as primers. The PCR amplification conditions are as described in Example 3. The products ampli- 
fied by PCR were analyzed by an agarose gel electrophoresis. The results are shown in Rg. 15. In this case, the rela- 
tionship between lanes and bacterial species is as follows. 

Lane 1 : Mycobacterium tuberculosis 
45 Lane 2: Mycobacterium bovis 
Lane 3: Mycobacterium africanum 
Lane 4: Mycobacterium microti 

[0067] As shown in Rg. 15, the amplified product was observed only by Mycobacterium tuberculosis when the oil- 
so gonucieotides represented by SEQUENCE NO. 45 and SEQUENCE NO. 47 were used as primers, the amplified prod- 
uct was observed only by Mycobacterium bovis when the oligonucleotides represented by SEQUENCE NO. 49 and 
SEQUENCE NO. 51 were used as primers, the amplified product was observed by Mycobacterium africanum and 
Mycobacterium microti when the oligonucleotides represented by SEQUENCE NO. 45 and SEQUENCE NO. 53 were 
used as primers and the amplified product was observed by only Mycobacterium microti when the oligonucleotides rep- 
55 resented by SEQUENCE NO. 55 and SEQUENCE NO. 57 were used as primers. Based on the above results, relation- 
ship between primers and bacterial species can be summarized as follows. 
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TABLE 4 



SEQUENCE No. 


Mycobacterium tuber- 
culosis 


Mycobacterium bovis 


Mycobacterium afri- 
canum 


Mycobacterium microti 


SEQ. NO. 41 
SEQ. NO. 43 


Amplification possible 


Amplification possible 


Amplification possible 


Amplification possiuie 


SEQ. NO. 45 
SEQ. NO. 47 


Amplification possible 


No amplification 


No amplification 


No amplification 


SEQ. NO. 49 
SEQ. NO. 51 


No amplification 


Amplification possible 


No amplification 


No amplification 


SEQ. NO. 45 
SEQ. NO. 53 


No amplification 


No amplification 


Amplification possible 


Amplification possible 


SEQ. NO. 55 
SEQ. NO. 57 


No amplification 


no amplification 


no amplification 


Amplification possible 



EXAMPLES 

[0068] A 10 ng portion of purified DNA was prepared from each of 4 tubercle bacilli group bacterial species. PGR 
25 was carried out using these DNA samples as templates, and the oligonucleotides represented by SEQUENCE NO. 41 
and SEQUENCE NO. 43 as primers. The PCR amplification conditions are as described in Example 3, The products 
amplified by PCR were digested with restriction enzymes Rsa I and Taq 1 , and the thus formed DNA fragments were 
analyzed by an agarose gel electrophoresis. The results are shown In Fig. 16. in this connection, the relationship 
between lanes and bacterial species Is as follows. 

30 

Lane 1: Mycobacterium tuberculosis 
Lane 2: Mycobacterium bovis 
Lane 3: Mycobacterium africanum 
Lane 4: Mycobacterium microti 

35 

EXAMPLE E 

[0069] Using the oligonucleotides represented by SEQUENCE NO. 1, SEQUENCE NO. 43, SEQUENCE NO. 45, 
SEQUENCE NO. 47, SEQUENCE NO. 49, SEQUENCE NO. 51, SEQUENCE NO. 53, SEQUENCE NO. 55 and 
40 SEQUENCE NO. 57 as primers, PCR was carried out on a solution of disrupted cells of a strain KPM KY631 isolated 
from a clinical patient of tuberculosis. When the product amplified by PCR was analyzed by an agarose gel electro- 
phoresis, the amplified product was observed only by the combination of SEQUENCE NO. 1 and SEQUENCE NO. 43 
and of SEQUENCE NO. 45 and SEQUENCE NO. 47, so that the strain KPM KY631 was Identified as the tubercle 
Mycobacterium tuberculosis (Table 4 and Fig. 15). 

45 

EXAMPLE 7 

[0070] Using the oligonucleotides represented by SEQUENCE NO. 41 and SEQUENCE NO. 43 as primers, PCR 
was carried out on a solution of disrupted cells of a strain KPM KY590 Isolated from a clinical patient of tuberculosis. 
so When nucleotide sequence of the thus amplified DNA fragment was determined, the thus obtained nucleotide 
sequence coincided with the nucleotide sequence of the tubercle Mycobacterium tuberculosis, so that the strain KPM 
KY590 was identified as the tubercle Mycobacterium tuberculosis (Figs. 1-11). 

EXAMPLE B 

55 

[0071] Using the oligonucleotides represented by SEQUENCE NO. 41 and SEQUENCE NO. 43 as primers, PCR 
was carried out on a solution of disrupted ceils of a strain isolated from a bovine patient of tuberculosis. When the prod- 
uct amplified by PCR was digested with restriction enzymes Rsa I and Taq I and the thus formed DNA fragments were 
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analyzed by an agarose gel electrophoresis, the result coincided with the pattern obtained from Mycobacterium bovis, 
so that this strain was identified as Mycobacterium bovis (Fig. 1 6). 

[0072] The present invention realizes accurate classification and identification of slow growing mycobacteria which 
are difficult to identify by conventional methods. It also renders possible quick identification of certain species of atypical 
5 mycobacteria, such as Mycobacterium kansasii and Mycobacterium gastri, which are difficult to distinguish by the iden- 
tification method based on 16S rRNA gene sequence. The present invention is useful in the fields of medicai science, 
immunology, veterinary science, etc. 
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SEQUENCE LISTING 



<110> MARINE BIOTECHNOLOGY INSTITUTE CO., LTD. 

<120> IDENTIFICATION METHOD AND SPECIFIC DETECTION METHOD OF 
SLOW GROWING MYCOBACTERIA UTILIZING DNA 6YRASE GENE 

<130> K4C-120036C/KI 

<140> 
<141> 

<150> JP 11-312525 
<151> 1999-11-02 

<160> 62 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 1 

ggtgtctcgg tggtcaacgc 

<210> 2 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Aaino acid 
sequence corresponding to primer or probe 

<400> 2 

Gly Val Ser Val Val Asn 

<210> 3 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DMA 

<400> 3 

gaccttgtgc ggggcggcgg 

<210> 4 
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<213> Artificial Sequence 
<220> 

5 <223> Description of Artificial Sequence: Amino acid 

sequence corresponding to primer or probe 

<400> 4 

Ala Ala Pro His Lys Val 

10 <210> 5 

<2T1> 20 
<212> DMA 

<213> Artificial Sequence 

rs <220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<400> 5 

caccttgtgg ggggcggtga 20 

20 <210> 6 

<211> 6 

<212> PRT 

<213> Artificial Sequence 



25 



30 



<220> 

<223> Description of Artificial Sequence: Amino acid 
sequence corresponding to primer or probe 

<400> 6 

Thr Ala Pro His Lys Val 

<210> 7 
<211> 1263 
<212> DMA 

<213> Mycobacterium simiae 

35 <220> 

<221> CDS 

<222> (1)..(1263) 

<400> 1 

40 999 gag aac agt ggc tac acc gtc age ggc ggg ttg cac ggg gtc gga 48 

fily Glu Asn Ser Sly Tyr Thr Val Ser Sly Sly Leu His Sly Val Sly 



45 



50 



55 



1 5 10 15 

gtg teg gtg gtc aac gec ctg tec acc cgc ctg gaa gtc aac gtc aag 

Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu 61 u Val Asn Val Lys 
20 25 30 



96 



cgt gac ggc tat gag tgg ttc cag tac tac gac egg gcg gtg ccc ggc 144 

Arg Asp Gly Tyr Glu Trp Phe Sin Tyr Tyr Asp Arg Ala Val Pro Gly 
35 40 45 

sec etc aag caa ggc gag gcg acc aag aag acc ggc acc acg ate egg 192 

Thr Leu Lys Gin Gly Glu Ala Thr Lys Lys Thr Gly Thr Thr He Arg 
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50 5S 60 



ttc tgg gee gat cct gag ate ttc gaa acc acc cag tac gac ttc gag 
Phe Trp Ala Asp Pro Glu II© Phe Glu Thr Thr Gin Tyr Asp Phe 61u 
65 70 75 80 



acg gtg gcg cgc egg ttg cag gaa atg gcg ttc etc aac aag ggc ctg 
Thr Val Ala Arg Ara Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu 
85 90 95 



240 



c ctg 288 



acc ate aac etc acc gac gaa cgt gtc gag cag gac gag gtg gtc gat 336 
Thr He Asn Leu Thr Asp Glu Arg Val Glu G1n Asp Glu Val Val Asp 
100 105 110 

is gag gtg gtt age gac acc gcc gag gcg ccg aag tea gcc gag aag cag 384 

Glu Val Val Ser Asp Thr Ala Glu Ala Pro Lys Ser Ala Glu Glu Gin 
115 120 125 

gcg gcc gaa teg gcc aag ccg cac aag gtc aag cac cgc acg ttc cac 432 
Ala Ala Glu Ser Ala Lys Pro His Lys Val Lys His Arg Thr Phe His 
20 130 135 140 

tac ccg ggt ggg ttg gtg gat ttc gtc aag cac ate aat cgc acc aaa 480 
Tyr Pro Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys 
145 150 155 160 

aac ccg ate cag cag age gtc ate gac ttc gac ggc aaa gga acc ggg 528 
Asn Pro He Gin Gin Ser Val He Asp Phe Asp Gly Lys Gly Thr Gly 
165 170 175 

cac gaa gtc gag ate gcg atg cag tgg aac ggt ggt tat teg gag teg 576 
30 His Glu Val Glu He Ala Het Gin Trp Asn Gly Gly Tyr Ser Glu Ser 

180 185 190 

gtg cac acc ttc gcc aac acc ate aac acc cat gag ggc ggc acc cac 624. 
Val His Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His 
195 200 205 

35 

gag gag ggc ttc cgc age gcg ctg acc teg gtg gtg aac aag tac gcc 672 
Glu Glu Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
210 215 220 

40 aaa gac aag aag ctg etc aag gac aag gat ccc aac etc acc ggc gac 720 

Lys Asp Lys Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp 
225 230 235 240 

gac ate cga gaa ggg ctg gee gcg gtg ate tec gtg aag gtc gcc gag 768 
Asp He Arg Glu Gly Leu Ala Ala Val lie Ser Val Lys Val Ala Glu 
245 250 255 



ccg cag ttc gag ggc cag act aag acg aaa etc ggc aac ace gag gtc 816 

Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val 
260 265 270 

aag teg ttt gtc cag aaa gtc tgt aac gaa caa etc act cac tgg ttc 864 

Lys Ser Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe 
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275 280 285 

gag gcg aac ccg teg gaa" get aaa acc gtt gta aac aag gcg gtt teg 912 

Glu Ala Asn Pro Ser Glu Ala Lys Thr Val Val Asn Lys Ala Val Ser 
290 295 300 

teg gec cag gec cge ait gcg gcg cgt aag gcg egg gag ttg gta egg 960 

Ser Ala Sin Ala Arg lie Ala Ala Arg Lys Ala Arg Blu Leu Val Arg 

305 310 315 320 



cgt aag agt get acg sat ttg ggt ggg ttg ccg ggc aag ttg get gat 1008 
Arg Lys Ser Ala Thr Asp Leu Sly Gly Leu Pro Sly Lys Leu Ala Asp 
325 330 335 

1055 



tgc cgc teg acg gat ccg egg aag tct gag ctg tat gtg gtg. gaa ggt 
Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val val Glu Gly 
340 345 350 



gat tec gcg ggt ggg teg gcg aaa agt ggg cgt sat teg atg ttc cag 1104 
Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
20 355 360 365. 



gcg ate ttg ccg ctg cgc ggc aag ate ate aac gtc gaa aag gee cgc 1152 
Ala lie Leu Pro Leu Arg Gly Lys lie He Asn Val Glu Lys Ala Arg 
370 375 380 

ate gat egg gtg ctg aaa aac ace gaa gtc cag gec ate ate acc gcg 1200 
He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He lie Thr Ala 
385 390 395 400 



ctg ggc aec ggc ate cac gac gaa ttc gac ate acc aaa ctg cgt tac 1248 
Leu Gly Thr Gly lie His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 
405 410 415 



cac aag ate gtg ttg 1263 
His Lys He Val Leu 
420 



<210> 8 

<211> 421 

<212> PRT 

40 <213> Mycobacterium sinriae 



<40O> 8 

Gly Glu Asn Ser Gly Tyr Thr Val Ser Gly Gly Leu His Gly Val Gly 
15 10 15 



Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asn Val Lys 
20 25 • 30 . 

Arg Asp Gly Tyr Glu Trp Phe Gin Tyr Tyr Asp Arg Ala Val Pro Gly 
35 40 45 

Thr Leu Lys Gin Gly Glu Ala Thr Lys Lys Thr Gly Thr Thr He Arg 
50 55 60 
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Phe Trp Ala Asp Pro Glu He Phe 

65 "70-:- 



Glu 



Thr Thr 
75 



Gin Tyr Asp 



Phe Glu 
80 



Thr Val Ala Arg Arg Leu Gin Glu Met Ala Phe leu Asn Lys Gly Leu 
85 90 95 

Thr He Asn Leu Thr Asp Glu Arg Val 61u Gin Asp Glu Val Va1 Asp 
100 105 110 

Glu Val Val Ser Asp Thr Ala Glu Ala Pro Lys Ser Ala Glu Glu Gin 
115 120 125 

Ala Ala Glu Ser Ala Lys Pro His Lys Val Lys His Arg Thr Phe His 
130 135 140 

Tyr Pro Bly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys 
145 150 155 160 

Asn Pro He Gin Gin Ser Val lie Asp Phe Asp Gly Lys Gly Thr Gly 
165 170 175 

His Glu Val Glu He Ala Het Gin Trp Asn Gly Gly Tyr Ser Glu Ser 
180 185 190 

Val His Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His 
195 . 200 205 

Glu Glu Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
210 215 220 

Lys Asp Lys Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp 
225 230 235 240 

Asp He Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ala Glu 
245 250 255 

Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val 
260 265 270 

Lys Ser Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe 
275 280 285 

Glu Ala Asn Pro Ser Glu Ala Lys Thr Val Val Asn Lys Ala Val Ser 
290 295 300 

Ser Ala Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg 
305 310 315 320 

Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 

Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
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355 360 365 

Ala He Leu Pro Leu 'Arg Gly Lys lie lie Asn Val 61u Lys Ala Arg 
370 375 380 

He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 400 

Leu Gly Thr Gly He His Asp Glu Phe Asp lie Thr Lys Leu Arg Tyr 
405 410 415 

His Lys He Val Leu 
420 

<210> 9 
<211> 1257 
<212> ONA 

.<213> Mycobacterium bovis 

20 <220> 

<221> CDS 

<222> (1)..(1257) 

<400> 9 

& teg sac gcg tat gcg ata tct ggt ggt ctg cac ggc gtc ggc gtg teg 48 

Ser Asp Ala Tyr Ala He Ser Gly Gly Leu His Gly Val Gly Val Ser 
1 5 10 15 

gtg gtt aac gcg eta tec acc egg etc gaa gtc gag ate aag cgc gac 96 
Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Lys Arg Asp 
a> 20 25 30 

ggg tac gag tgg tct cag gtt tat gag aag teg gaa ccc ctg ggc etc 144 
Gly Tyr Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Leu Gly Leu 
35 40 45 



aag caa ggg gcg ccg acc aag aag acg ggg tea acg gta egg ttc tgg 192 
Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Ser Thr Val Arg Phe Trp 
50 55 60 



gec gac ccc get gtt ttc gaa acc acg gaa tac gac ttc gaa acc gtc 
Ala Asp Pro Ala Val Phe Glu Thr Thr Glu Tyr Asp Phe Glu Thr Val 
65 70 75 80 



240 



gec cgc egg ctg caa gag atg gcg ttc etc aac aag ggg ctg acc ate 288 
Ala Arg Arg Leu Gin Glu Net Ala Phe Leu Asn Lys Gly Leu Thr He 
85 90 95 



aac ctg acc gac gag agg gtg acc caa gac gag 'gtc gtc gac gaa gtg 336 
Asn Leu Thr Asp Glu Arg Val Thr Gin Asp Glu Val Val Asp Glu Val 
100 105 110 



gtc age gac 9tc gec gag gcg ccg aag teg gca agt gaa cgc gca gee 384 
Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Arg Ala Ala 
115 120 125 
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10 



is 



20 



30 



35 



40 



45 



50 



gaa tec act gca ccg cae aaa gtt aag 
Glu Ser Thr Ala Pro His Lys Val Lys 



130 



135 



age cgc acc ttt 
Ser Arg Thr Phe 
140 



ggt ggc ctg gtg gac ttc gtg 
Gly 61 y Leu Val Asp Phe Val Lys His 



145 



150 



aaa cac ate aac cgc acc 
lie Asn Arg Thr 
155 



att cat age age ate gtg gac ttt tec 
He His Ser Ser He Val Asp Phe Ser 
165 

gtg gag ate gcg atg caa tgg aac gec 
Val Slu He Ala Met Gin Trp Asn Ala 
180 185 

acc ttc gee aac acc ate aac acc cac 
Thr Phe Ala Asn Thr. He Asn Thr His 
195 200 

ggc ttc cgc age gcg ctg acg teg gtg 
Gly Phe Arg Ser Ala Leu Thr Ser Val 



ggc aag ggc acc 
Gly Lys Gly Thr 
170 

ggg tat teg gag 
Gly Tyr Ser Glu 



R! 16 ?? 



ic acc 
y Thr 
205 



cac tat ccg 
His Tyr Pro 



aag aac gcg 
Lys Asn Ala 
160 

ggg cac gag 
Gly His Slu 
175 

teg gtg cac 
Ser Val His 
190 

cac gaa gag 
His Slu Slu 



210 



215 



cgc aag eta ctg aag gac aag gac ccc 
Arg Lys Leu Leu Lys Asp Lys Asp Pro 
225 230 



egg 
Arg 



aa ggc ctg gee get gtg ate teg 
lu Sly Leu Ala Ala Val He Ser 
245 



ttc gag ggc cag acc aag acc aag ttg 
Phe Glu Gly Gin Thr Lys Thr Lys Leu 
260 265 

ttt gtg cag aag gtc tgt aat gaa cag 
Phe Val Gin Lys Val Cys Asn Glu Gin 



gtg aac aag tac 
Val Asn Lys Tyr 
220 

aac etc acc ggt 
Asn Leu Thr Gly 
235 

gtg aag gtc age 
Val Lys Val Ser 
250 

ggc aac acc gag 
Gly Asn Thr Slu 



275 



280 



aac ccc acc gac teg aaa gtc gtt gtg 
Asn Pro Thr Asp Ser Lys Val Val Val 
290 295 



ctg acc cac tgg 
Leu Thr His Trp 
285 

aac aag get gtg 
Asn Lys Ala Val 
300 



gee aag gac 
Ala Lys Asp 



gac gat ate 
Asp Asp He 
240 

gaa ccg cag 
Slu Pro Sin 
255 

gtc aaa teg 
Val Lys Ser 
270 

ttt gaa gee 
Phe Glu Ala 



tec teg gcg 
Ser Ser Ala 



caa gee cgt ate gcg gca cgt aag gca cga gag ttg gtg egg cgt aag 
Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 
305 310 315 320 

age gee acc gac ate ggt gga ttg ccc ggc aag ctg gee gat tgc cgt. 
Ser Ala Thr Asp He Gly Gly Leu Pro Gly Lys Leu Ala Asp Cys Arg 



325 



330 



335 



tec acg gat ccg cgc aag tec gaa ctg tat gtc gta gaa 
Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu <5Ty Asp Ser 
340 345 350 



ggt gac teg 



432 

480 

528 

576 

624 

672 

720 

768 

816 

864 

912 

960 

1008 

1056 
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gee ggc ggt tct gca aaa age ggt cgc gat teg atg ttc cag gcg ata 1104 
Ala Gly Sly Ser Ala Lys Ser Gly Arg Asp Ser Het. Phe Gin Ala lie 
355 360 365 

ctt ceg ctg cgc ggc aag ate ate aat gtg gag aaa gcg cge ate gac 1152 
Leu Pro Leu Arg Gly Lys He He Asn VaT Glu Lys Ala Arg He Asp 
370 375 380 

egg gtg eta aag aac ace gaa gtt cag gcg ate ate acg gcg ctg ggc 1200 
Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala Leu Gly 
385 390 395 400 

ace ggg ate cac gac gag ttc gat ate ggc aag ctg cgc tae cac aag 1248 
.Thr Gly He His Asp Glu Phe Asp He Gly Lys Leu Arg Tyr His Lys 
405 410 415 

ate gtg ctg 1257 
He Val Leu 

<210> 10 
<211> 419 
<212> PRT 

<213> Mycobacterium bo vis 
<400> 10 

Ser Asp Ala Tyr Ala He Ser 61y Gly Leu His Gly Val Gly Val Ser 
1 5 10 15 

Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Lys Arg Asp 
20 25 30 

Gly Tyr Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Leu Gly Leu 
35 40 45 

Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Ser Thr Val Arg Phe Trp 
50 55 60 

Ala Asp Pro Ala Val Phe Glu Thr Thr Glu Tyr Asp Phe Glu Thr Val 
65 70 75 80 

■K Ala Arg Arg Leu Gin Glu Het Ala Phe Leu Asn Lys Gly Leu Thr lie 

85 90 95 

Asn Leu Thr Asp Glu Arg Val Thr Gin Asp Glu Val Val Asp Glu Val 
100 105 110 

45 Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Arg Ala Ala 

115 120 125 • 

Glu Ser Thr Ala Pro His Lys Val Lys Ser Arg Thr Phe His Tyr Pro 
130 135 140 



30 



35 



SO 



55 



Gly Gly Leu Val Asp Phe Val Lys His lie Asn Arg Thr Lys Asn Ala 
145 150 155 160 
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10 



is 



20 



25 



30 



35 



40 



45 



He His Ser Ser He Val Asp Phe Ser Sly Lys Gly Thr Gly His Glu 
165 " ' 170 . 175 

Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val His 
180 185 190 

Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His Glu Glu 
195 200 205 

Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala Lys Asp 
210 215 220 

Arg Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp Asp He 
225 230 235 240 

Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ser Glu Pro Bin 
245 250 255 

Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val Lys Ser 
260 265 270 

Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu Ala 
275 280 285 

Asn Pro Thr Asp Ser Lys Val Val Val Asn Lys Ala Val Ssr Ser Ala 
290 295 300 

Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 
305 310 315 320 

Ser Ala Thr Asp He Gly Gly Leu Pro Gly Lys Leu Ala Asp Cys Arg 
325 330 335 

Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly Asp Ser 
340 345 350 

Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala lie 
355 360 365 

Leu Pro Leu Arg Gly Lys He He Asn Val Glu Lys Ala Arg lie Asp 
370 375 380 



Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala Leu Biy 
385 390 395 400 

Thr Gly He His Asp G1u Phe Asp He Gly Lys Leu Arg Tyr His Lys 
405 410 415 

•He Val Leu 



so 



<210> 11 
<211> 1263 
<212> DNA 
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20 



25 
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40 



<213> Mycobacterium szulgai 

<220> * 

<221> CDS 

<222> (O..0263) 

<400> 11 



ggc gag aac agt ggc tac aac gtc agt ggt ggt ctg cac ggc gtc ggg 
Gly Glu Asn Ser Gly Tyr Asn Val Ser Gly Gly Leu His Gly Val Gly 



10 15 

gtg teg gtg gtg aac gcg ctg teg acc egg etc gag gtc gac ate aag 96 
Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asp He Lys 
20 25 30 



age ccg ate cag cag agt gtc gtc gec ttc gac 99c aag agt gaa ggg 
Ser Pro He Gin Gin Ser Val Val Ala Phe Asp Gly Lys Gly Glu Gly 
165 170 175 

cac gag gtc gag ate gcg atg cag tgg aac ggc ggc tat teg gag teg 
His Glu Val Glu He Ala Het Gin Trp Asn Gly Gly Tyr Ser 61u Ser 



48 



ic 144 

y 



cgt gac ggc cac aag tgg teg cag ttc tac aac aag gec gtg ccg gg 

Arg Asp Gly His Lys Trp .Ser Gin Phe Tyr Asn Lys Ala Val Pro Gl 

35 40 45 

aeg etc aaa cag ggt gaa gee act aag aaa acc gga acg aea att agg 

Thr Leu Lys Gin Gly Glu Ala Thr Lys Lys Thr Gly Thr Thr lie Arg 

50 55 60 

ttc tgg gec gac ccg gac ate ttc gag acc ace gaa tac gac ttc gag 

Phe Trp Ala Asp Pro Asp lie Phe Glu Thr Thr Glu Tyr Asp Phe Glu 

65 70 75 80 

acc gtg gca cgc egg ctg cag gaa atg gca ttc ctg aac aag ggc ttg 288 

Thr Val Ala Arg Arg Leu Gin Glu Het Ala Phe Leu Asn Lys Gly Leu 

85 90 95 



192 
240 



30 acc ate aac etc acc gac gag cga gtt gec cag gac gag gtt gte gac 336 

Thr He Asn Leu Thr Asp Glu Arg Val Ala Gin Asp Glu Val Val Asp 
100 105 110 



384 



432 



gag gtc gtc age gac acc gec gag gca ccc aag tec gee gaa gaa aag 
Glu Val Val Ser Asp Thr Ala Glu Ala Pro Lys Ser Ala Glu Glu Lys 
115 120 125 

gcg gee gaa tec aaa ggg ccg cat aag gtt aag cac cgc act ttc cat 
Ala Ala Glu Ser Lys Gly Pro His Lys Val Lys His Arg Thr Phe His 
130 135 140 

tac ccc ggc ggg ctg ate gac ttc gtc aag cac ate aac egg acc aag 480 
Tyr Pro Gly Gly Leu He Asp Phe Val Lys His lie Asn Arg Thr Lys 
145 150 155 160 



528 
576 



so 180 185 190 

gtg cac acc ttc gec aac acc ate aac acc eac gag ggc ggc acc cac 624 



55 
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Val His Thr Phe Ala Asn Thr 
195 



10 



15 



20 



25 



30 



35 



40 



gaa gaa 
Glu Glu 
210 

aaa gac 
Lys Asp 
225 

gac att 
Asp He 



ccg cag 
Pro Gin 



aag teg 
Lys Ser 



gag gec 

Glu Ala 
290 

teg gca 
Ser Ala 
305 

cgc aag 
Arg Lys 



tge cgc 
Cys Arg 



gae teg 
Asp Ser 



ggg ttc cgc 
Gly Phe Arg 



aag aag. ctg 
Lys Lys Leu 



50 



gcg at a 
Ala He 

370 

ate gac 
lie Asp 
385 

ctg ggt 
Leu Gly 



cgc gag gac 
Arg Glu Gly 
245 

ttc gag ggc 
Phe Glu Gly 
260 

ttc gta cag 
Phe Val Gin 
275 

aac ccg teg 
Asn Pro Ser 



cag gcg cgt 
Gin Ala Arg 



age get ace 
Ser Ala Thr 
325 

tee ace gat 
Ser Thr Asp 
340 

gee ggc ggc 
Ala Gly Gly 
355 

ctt ccg ttg 
Leu Pro Leu 



age gca 
Ser Ala 
215 

etc aag 
Leu Lys 
230 

ctg gec 
Leu Ala 



lie Asn 
200 

ctg aca 
Leu Thr 



ag aag 
lu Lys 



gcg gte 
Ala Val 



Thr His Glu 



* fir til 

220 



Ser 



Gly Gly Thr His 
205 

aac aag tac gee 
Asn Lys Tyr Ala 



cag ace 
Gin Thr 



aag gtc 

Lys Val 



gaa gee 
Glu Ala 
295 

ate gec 
He Ala 
310 

gat cte 
Asp Leu 



ccg cgc 
Pro Arg 



aag ace 
Lys Thr 
265 

tge aac 
Cys Asn 
280 

aaa acc 
Lys Thr 



gee cgc 
Ala Arg 



gac gec aac 
Asp Ala Asn 
235 

ate teg gtg 
He Ser Val 
250 

aaa ctg ggt 
Lys Leu Gly 



aa cag ctg 
lu Gin Leu 



tec gec 
Ser Ala 



egg gtg ctg 
Arg Val Leu 



acc gga att 
Thr Gly He 
405 



cgc gac 
Arg Gly 
375 

aag aac 
Lys Asn 
390 

cac gac 
His Asp 



aag teg 
Lys Ser 
345 

aag age 
Lys Ser 
360 

aag ate 
Lys lie 



itc gtg aac 
al Val Asn 
300 

aag gca cga 
Lys Ala Arg 
315 

ctg ecc ggc 
Leu Pro Gly 
330 

aa ttg tat 
lu Leu Tyr 



etc acc ggc gac 
Leu Thr Gly Asp 
240 

aaa gtt gec gaa 
Lys Val Ala Glu 
255 

aac aec gag gtc 
Asn Thr Glu Val 
270 

acc cac tgg ttc 
Thr His Trp Phe 
285 

aag gcg gtc teg 
Lys Ala Val Ser 



c cgc gac 
y Arg Asp 



aec gaa 
Thr Glu 



gag ttc 
Glu Phe 



ate aac gtc 
He Asn Val 
380 

gtc cag gcg 
Val Gin Ala 
. 395 

gac etc gec 
Asp Leu Ala 
410 



gag ttg gtg cgt 
Glu Leu Val Arg 
320 

aag ctg gec gac 
Lys Leu Ala Asp 
335 

gta gtg gaa ggg 
Val Val Glu Gly 
350 

teg atg ttt cag 
Ser Met Phe Gin 
365 

gag aag gee cgc 
Glu Lys Ala Arg 



ate ate acc gcg 
He He Thr Ala 
400 

aaa ctg cgc tac 
Lys Leu Arg Tyr 
41 5 



cac aag ate gtg ctg 



672 

720 

768 

816 

864 

912 

960 

1008 

1056 

1104 

1152 

1200 

1248 

1263 
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His Lys He Val Leu 
420 



<210> 12 
<211> 421 
<212> PRT 

<213> Hycobacterium szulgai 
<400> 12 



Gly 


Glu Asn Ser Gig Tyr Asn Val Ser 


6 ]8 


Gly Leu 


His Gly 


Val 
15 


Gly 


Val 


Ser Val Val Asn Ala Leu Ser Thr 
20 25 


Arg 


Leu Glu 


Val Asp 


lie 


Lys 


Arg 


Asp Gly His Lys Trp Ser Gin Phe 
35 40 


Tyr 


Asn Lys 


Ala Val 
45 


Pro 


Gly 


Thr 


Leu Lys Gin 61 y 61 u Ala Thr Lys 
50 55 


Lys 


Thr Gly 
60 


Thr Thr 


He 


Arg 


Phe 
65 


Trp Ala Asp Pro Asp lie Phe Glu 
70 


Thr 


Thr Glu 
75 


Tyr Asp 


Phe 


Glu 
80 


Thr 


Val Ala Arg Arg; Leu Gin Glu Net 
85 


Ala 
90 


Phe Leu 


Asn Lys 


Gly 
95 


Leu 


Thr 


He Asn Leu Thr Asp Glu Arg Va1 
100 105 


Ala 


Gin Asp 


Glu Val 
110 


Val 


Asp 


Glu 


Val Val Ser Asp Thr Ala Glu Ala 
115 120 


Pro 


Lys Ser 


Ala Glu 
125 


Glu 


Lys 


Ala 


Ala Glu Ser Lys Gly Pro His Lys 
130 135 


Val 


Lys His 
140 


Arg Thr 


Phe 


His 


Tyr 
145 


Pro Gly Gly Leu He Asp Phe Val 
150 


Lys 


His lie 
155 


Asn Arg 


Thr 


Lys 
160 


Ser 


Pro He Gin Gin Ser Val Val Ala 
165 


Phe 
170 


Asp Gly 


Lys Gly 


Glu 
175 


Gly 


His 


Glu Val Glu He Ala Met Gin Trp 
180 185 


Asn 


Gly Gly 


Tyr Ser 
190 


Glu 


Ser 


Val 


His Thr Phe Ala Asn Thr lie Asn 
195 200 


Thr 


His Glu 


Gly Gly 
205 


Thr 


His 


Glu 


Glu Gly Phe Arg Ser Ala Leu Thr 
210 215 


Ser 


Val Val 
220 


Asn Lys 


Tyr 


Ala 


Lys 
225 


Asp Lys Lys Leu Leu Lys Glu Lys 
230 


Asp 


Ala Asn 
235 


Leu Thr 


Gly 


Asp 
240 


Asp 


He Arg Glu Gly Leu Ala Ala Val 


He 


Ser Val 


Lys Val 


Ala 


Glu 



26 



35 



45 



50 
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246- 




250 




255 




5 


Pro Gin 


Phe 


Glu 
260 


Gly Gin 


Thr Lys Thr 
265 


Lys Leu 


Gly Asn Thr 61 u 
270 


Val 




Lys Ser 


Phe 
275 


Val 


Gin Lys 


Val Cys Asn 
280 


Glu Gin 


Leu Thr His Trp 
285 


Phe 


10 


Glu Ala 
290 


Asn 


Pro 


Ser Glu 


Ala Lys Thr 
295 


Val 


Val 


Asn Lys Ala Val 
300 


Ser 




Ser Ala 
305 


Gin 


Ala 


Arg lie 
310 


Ala Ala Arg 


Lys 


Ala 
315 


Arg Glu leu Val 


Arg 
320 


IS 


Arg Lys 


Ser 


Ala 


Thr Asp 
325 


Leu Gly Gly 


Leu 

330 


Pro 


Gly Lys Leu Ala 
335 


Asp 




Cys Arg 


Ser 


Thr 
340 


Asp Pro 


Arg Lys Ser 
345 


Glu 


Leu 


Tyr Val Val Glu 
350 


Gly 


20 


Asp Ser 


Ala 
355 


Gly 


Gly Ser 


Ala Lys Ser 
360 


Gly 


Arg 


Asp Ser Net Phe 
365 


Gin 




Ala He 
370 


Leu 


Pro 


Leu Arg 


Gly Lys He 
375 


He 


Asn 


Val Glu Lys Ala 
380 


Arg 


25 


He Asp 
385 


Arg 


Val 


Leu Lys 
390 


Asn Thr Glu 


Val 


Gin 
395 


Ala lie He Thr 


Ala 
400 


30 


Leu Gly 


Thr 


Gly 


He His 
405 


Asp Glu Phe 


Asp 
410 


Leu 


Ala Lys Leu Arg 
415 


Tyr 


His Lys 


He 


Val 
420 


Leu 













<210> 13 
<211> 1263 
<212> DMA 

<213> Mycobacterium malmoense 



<220> 

to <221> CDS 



<222> (1)..(1263) 
<400> 13 



igc gag aac age gga tac aac gtc agt ggc ggt ttg cac ggt gtc ggc 48 

ily Glu Asn Ser Gly Tyr Asn Val Ser Gly Gly Leu His Gly Val Gly 
1 5 10 15 

gtg teg gtg gtc aac geg ttg teg ace egg etc gag gtg gat gtc gee 96 

Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asp Val Ala 
20 25 30 

cgc gac ggc tac atg tgg tea cag ttc tac gat cac gee gag ccg gga 144 

Arg Asp Gly Tyr Met Trp Ser Gin Phe Tyr Asp His Ala Glu Pro Gly 



27 
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10 



25 



45 



50 



55 



35 40 45 

acc etc aaa cag ggc gag gec acc aag acg acg gga acc acc ate agg 
Thr Leu Lys Gin 61 y Glu Ala Thr Lys Thr Thr Gly Thr Thr He Arg 



50 55 6 



, Asp 

225 230 235 240 

gac ate egg gaa ggc ctg gca gcg gtc att tec gte aag gtc age gaa 
Asp He Arg Glu Gly Leu Ala Ala Val lie Ser Val Lys Val Ser Glu 
245 250 255 



192 



ttc tgg gec gat ccc gac ate ttc gag acc acc gag tac gac ttc gag 240 
Phe Trp Ala Asp Pro Asp He Phe Glu Thr Thr 61 u Tyr Asp Phe Glu 
65 70 75 80 



acg gtg gcg cgc cga ctg cag gaa atg gcg ttc ctg aac aag got ttg 288 
Thr Val Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu 
85 90 95 



w acg ate aac etc acc gac gag egg gtc agt gaa gag gag gtc gtc gac 336 

Thr lie Asn Leu Thr Asp Glu Arg Val Ser Glu Glu Glu Val Val Asp 
100 105 110 

gat stc gtc age aac acc gec gag gca ccc aag tec gec gta gaa aaa 384 

Asp Val Val Ser Asp Thr Ala Glu Ala Pro Lys Ser Ala Val Glu Lys 
20 115 120 125 

gcg gec gaa teg act ggc eca cac aag gtt aag cac egc acg ttc cae 

Ala Ala Glu Ser Thr Gly Pro His Lys Val Lys His Arg Thr Phe His 
130 135 140 



432 



tac ccg ggc ggc ttg gtg gac ttc gtc aag cae ate aat egg acc aag 480 
Tyr Pro Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys 
145 150 155 160 

aac ccg att cac aac age ate gtg gat ttc tec ggc aag gga ccg ggc 528 
30 Asn Pro He His Asn Ser He Val Asp Phe Ser Gly Lys Gly Pro Gly 

165 170 175 

cac gag gtc gaa ate gcg atg cag tgg aat gec ggc tac teg gag teg 576 
His Glu Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser 
35 180 185 190 

gtg cac acc ttc gec aac acc ate aac acc cac gag ggc ggc acc cac 624 
Val His Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His 
195 200 205 

40 gaa gag ggc ttc cgc age gcg ttg acg teg gtg gtc aac aaa tac gee 672 

Glu Glu Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
210 215 220 



aag gac cgc aaa etc ctg aag gac aaa gac ccc aac etc acc ggc gac 720 
Lys Asp Arg Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly " 



768 



ccg caa ttc gag ggc cag acc aaa acc aag ctg ggc aac acc gag gtc 816 
Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val 
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260 265 270 

aag teg ttc gtg cag aag'gtc tgc aac gaa cag etc acg cac tgg ttc 
Lys Ser Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe 
275 280 285 

laa gec aac ccg gcg gat gec aaa act gtt gta aae aag gcg gtt teg 
ilu Ala Asn Pro Ala Asp Ala Lys Thr Val Val Asn Lys Ala Val Ser 
290 . 295 300 

teg gee cag gee cga ate gca gcg cge aag gcg cga gaa ctg gtg cgc 
Ser Ala Gin Ala Arg lie Ala Ala Arg Lys Ala Arg Glu Leu Val Arg. 
305. 310 315 320 

cgc aag age gee acc gac etc ggt agg ctg ccg ggt aag etc gca gac 
Arg Lys Ser Ala Thr Asp Leu Gly Sly Leu Pro Gly Lys Leu Ala Asp 
• 325 330 335 



tgc cgc tec ace gac ccg cga aag teg gaa ctg tat gtg gtg gag ggt 
Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 



ctg ggc acg ggg att cac' gac gag ttc gac ate acc aag etc egg tac 
Leu Gly Thr Gly* He His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 
405 410 415 



864 
912 
960 
1008 
. 1056 



gac .teg gee ggc ggc teg gec aag age ggc cgc gac teg atg ttc cag 1104 
Asp Ser Ala Gly Gly Ser Ala Lye Ser Gly Arg Asp Ser Met Phe Gin 
355 360 365 



gcg ate etc ccg ctg cgt ggc aag ate ate aae gtc gag aag gcg cgc 1152 
Ala He Leu Pro Leu Arg Gly Lys lie He Asn Val Glu Lys Ala Arg 
370 375 380 



ate gac egg gtg ctg aag aac acc gaa gtt cag gcg ate ate acc gcg 1200 
He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 400 



1248 



cac aag ate gtg ctg 1263 
His Lys He Val Leu 
420 

<210> 14 
<211> 421 
<212> PRT 

<213> Mycobacterium malmoense 
<400> 14 

Gl^ Glu Asn Ser Gly Tyr Asn Val Ser Gig Gly Leu His Gly Val Gly 

Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asp Val Ala 
so 20 25 30 

Arg Asp Gly Tyr Met Trp Ser Gin Phe Tyr Asp His Ala Glu Pro Gly 



55 



29 



• 
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35- 

Thr Leu Lys 61 n 
50 

Phe Trp Ala Asp 
65 



40 



45 



Gly Slu'Ala Thr Lys Thr Thr 61 y Thr Thr He Arg 
55 60 

Pro Asp He Phe Glu Thr Thr 61 u Tyr Asp Phe 61 u 
70 75 80 



10 



Thr Val Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Sly Leu 
85 90 95 



15 



20 



Thr He Asn Leu 
100 

Asp Val Val Ser 
115 

Ala Ala Glu Ser 
130 

Tyr Pro Gly Gly 
145 



Thr Asp Glu Arg Val Ser Glu Glu Glu Val Val Asp 
105 110 

Asp Thr Ala Glu Ala Pro Lys Ser Ala Val Glu Lys 
120 125 

Thr Gly Pro His Lys Val Lys His Arg Thr Phe His 

135 - 140 . 

Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys 
150 155 160 



25 



30 



35 



Asn Pro He His Asn Ser He Val Asp Phe Ser. Gly Lys Gly Pro Gly 
165 170 175 



His Glu Val Glu 
180 

Val His Thr Phe 
195 

Glu Glu Gly Phe 
210 

Lys Asp Arg Lys 
225 



He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser 
185 190 

Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His 
200 205 

Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
215 220 

Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp 
. 230 235 240 



Asp He Arg Glu Gly Leu Ala Ala Val lie Ser Val Lys Val Ser Glu 
245 



250 



255 



40 



45 



SO 



Pro Gin Phe Glu 
260 

Lys Ser Phe Val 
275 

Glu Ala Asn Pro 
290 

Ser Ala Gin Ala 
305 



Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val 
265 270 

Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe 
280 285 

Ala Asp Ala Lys Thr Val Val Asn Lys Ala Val Ser 
295 300 

Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg 
310 315 320 



Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 ' 



30 
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15 
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Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Vai Vai Glu Gly 
340 345 . 350 

Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
355 360 365 

Ala lie Leu Pro Leu Arg Gly Lys He He Asn Vai Glu Lys Ala Arg 
370 375 380 

He Asp Arg Vai Leu LysAsn Thr Glu Vai Gin Ala He He Thr Ala 
385 390 395 400 

Leu Gly Thr Gly He His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 
405 410 415 

His Lys He Vai Leu 
420 



<210> 15 
20 <211> 1263 



<212> DMA 
<213> Mycobacterium intracellulars 

<220> 

» <221> CDS 

<222> (1)..(1263) 

<400> 15 

ggt gag aac age ggt tac aac gtc age ggt ggc etg cac ggc gtg ggc 48 
Gly 61 u Asn Ser Gly Tyr Asn Vai Ser Gly Gly Leu His Gly Vai Gly 
so 1 5 10 15 

gtc teg gtg gtc aac gcg etg teg acc egg etc gag gtg gae ate gee 96 
Vai Ser Vai Vai Asn Ala Leu Ser Thr Arg Leu Glu Vai Asp He Ala 
20 25 30 

cgc gat ggc tac gaa tgg teg cag ttc tac gae cac gee gta ccc gga 144 
Arg Asp Gly Tyr Glu Trp Ser Gin Phe Tyr Asp His Ala Vai Pro Gly 
35 40 . 45 

acg etc aaa cag ggt gag gec acc aag egg acg ggc acc acg ate agg 192 
io Thr Leu Lys Gin Gly Glu Ala Thr Lys Arg Thr Gly Thr Thr He Arg 

50 55 60 

ttc tgg gec gae ccc gae ate ttc gag acc acc gag tac gae ttc gag 240 
Phe Trp Ala Asp Pro Asp He Phe Glu Thr Thr Glu Tyr Asp Phe Glu 
<5 65 70 75 80 

acg gtg gcg cgc egg etg cag gaa atg gcg ttc etc aao aag ggg ttg 268 
Thr Vai Ala Arg Arg Leu 61 n Glu Het Ala Phe Leu Asn Lys Gly Leu 
85 90 95 

50 acc ate aac etc acc gae gag egg gtg age aac gag gag gtc gtc gae 336 

Thr lie Asn Leu Thr Asp Glu Arg Vai Ser Asn Glu Glu Vai Vai Asp 
100 105 110 



ss 



31 



EP 1 098 003 A2 



10 



15 



20 



25 



30 



35 



40 



lag gtc gtc age gat acc gec gac gca ccc aag teg 
lu Val Val Ser Asp ThrAla Asp Ala Pro Lys Ser 
115 120 



gcg gcg gaa teg act gcg cca cat aag gtt aag cac 
Ala Ala Glu Ser Thr Ala Pro His Lys Val Lys His 
130 135 • HO 

tac cee ggc gat ctg gtc gac ttc gtc aag cac ate 
Tyr Pro Gly Gly Leu Val Asp Phe Val Lys His He 
145 150 155 

age ccg ate cag cag age ate ate gac ttc gac ggc 
Ser Pro He 61n Gin Ser He He Asp Phe Asp Gly 
165 170 

cac gag gtc gag ate gcg atg cag tgg aac ggc ggc 
His Glu Val Glu lie Ala Met Gin Trp Asn Gly Gly 
180 185 

gtg cac acc ttc gec aac ace ate aac ace cac gag 
Val His Thr Phe Ala Asn Thr He Asn Thr His Glu 
195 200 

gaa gag ggc ttc cgc age gcg ctg acg teg gtg gtg 
Glu Glu Gly Phe Arg Ser Ala Leu Thr Ser Val Val 
210 215 220 



gee cag gaa aag 
Ala Gin Glu Lys 
125 

cgc acc ttc cac 
Arg Thr Phe His 



aac cgc acc aag 
Asn Arg Thr Lys 
160 

aaa ggt ccc ggc 
Lys Gly Pro Gly 
175 

tac teg gaa tec 
Tyr Ser Glu Ser 
190 

ggc ggc acc cac 
Gly Gly Thr His 
205 

aac aag tac gee 
Asn Lys Tyr Ala 



aaa gac aag aag ttg ctg aaa gac aag gac ccg aac 
Lys Asp Lys Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr 
225 230 235 



etc ace ggc gac 
Gly Asp 
240 



gac att cgc gaa ggc ctg gec gcg gtg ate teg gtc 
Asp He Arg Glu Gly Leu Ala Ala Val He Ser Val 
245 250 



ccg cag ttc gag got cag ace aag acc aag ctg ggc 
Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly 
260 265 



50 



aag teg ttc gtg cag aag gtc tgc aac gaa cag etc 
Lys Ser Phe Val Gin Lys Val.Cys Asn Glu Gin Leu 
275 280 

gag gec aac ccc gcg gac gec aag gtg gtg gtc aac 
Glu Ala Asn Pro Ala Asp Ala Lys Val Val Val Asn 
290 295 300 

teg gcg cag gee egg ate gec gcg cgc aag gcg cga 
Ser Ala Gin Ala Arg He Ala Ala Arg Lys Ala Arg 
305 310 315 

cgc aag age gec acc gat ctg ggc ggg ctg ccc ggc 
Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly 
325 330 



aag gtc age gaa 
Lys Val Ser Glu 
255 

aac acc gaa gtg 
Asn Thr Glu Val 
270 

acc cac tgg ttc 
Thr His Trp Phe 
285 

aag gcg gtg teg 
Lys Ala Val Ser 



gag ttg gtg cgt 
Glu Leu Val Arg 
320 

aag etc gee gac 
Lys Leu Ala Asp 
335 



384 
432 
480 
528 
576 
624 
672 
720 
768 
816 
864 
912 
960 
1008 
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toe cgc teg acg gat ccg cgc aag teg gaa ctg tat gtg gtg gag ggt 1056 
Cys Arg Ser Thr Asp Pro'Arg Lys Ser 51 u Leu Tyr Val Val Glu Gly 
• 340 . 345 350 

gat teg gee ggc ggc teg gcg aag age ggc cgc gac teg atg ttc cag 1104 
Asp Ser Ala Gly Sly Ser Ala Lys Ser Sly Arg Asp Ser Met Phe Gin 
355 360 365 

gee ate ctg ccg ctg cgc ggc aag ate ate aac gtc gag aag gee cgc 1152 
Ala lie Leu Pro Leu Arg Gly Lys He He Asn Val Glu Lys Ala Arg 
370 375 380 

ate gac egg gtg ttg aag aac acc gag gtg cag gee ate ate acc gec 1200 
He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 400 



ctg ggc acc ggc ate cac gae gag ttc gac ate acc aag ctg cgc tat 1248 
Leu Gly Thr Gly. lie His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 
405 410 415- 



cac aag ate gtg ctg 1263 
His Lys He Val Leu 
420 

<210> 16 
<211> 421 
<212> PRT 

<213> Mycobacterium intracellulars 
<400> 16 

Gly Glu Asn Ser Gly Tyr Asn Val Ser Gly Gly Leu His Gly Val Gly 
1 5 10 15 . 

Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asp He Ala 
35 20 25 30 

Arg Asp Gly Tyr Glu Trp Ser Gin Phe Tyr Asp His Ala Val Pro Gly 
35 40 45 

40 Thr Leu Lys Gin Gly Glu Ala Thr Lys Arg Thr Gly Thr Thr lie Arg 
50 55 60 

Phe Trp Ala Asp Pro Asp He Phe Glu Thr Thr Glu Tyr Asp Phe Glu 
65 70 75 80 . 



Thr Val Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu 

85 90 95 

Thr He Asn Leu Thr Asp Glu Arg Val Ser Asn Glu Glu Val Val Asp 
100 105 110 

Glu Val Val Ser Asp Thr Ala Asp Ala Pro Lys Ser Ala Gin Glu Lys 
115 120 125 



55 
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w 



is 



20 



25 



30 



35 



40 



45 



50 



Ala Ala Glu Ser Thr Ala Pro His Lys Val Lys His Arg Thr Phe His 
130 .135 140 

Tyr Pro Gly Gly Leu Val Asp Phe Val Lys His lie Asn Arg Thr Lys 
145 150 155 160 

Ser Pro He Gin Gin Ser lie lie Asp Phe Asp Gly Lys Gly Pro Gly 
165 170 175 

His 61 u Val 61 u He Ala Met 61 n Trp Asn Gly Gly Tyr Ser Glu Ser 
180 185 190 

Val His Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Sly Thr His 
195 ZOO 205 

Glu G1u Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
210 215 220 

Lys Asp Lys Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp 
225 230 235 240 

Asp He Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ser Glu 
245 250 255. 

Pro Gin Phe G1u Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val 
260 265 270 

Lys Ser Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe 
275 280 285 

Glu Ala Asn Pro Ala Asp Ala Lys Val Val Val Asn Lys Ala Val Ser 
290 295 300 

Ser Ala G1n Ala Arg lie Ala Ala Arg Lys Ala Arg Glu Leu Val Arg 
305 310 315 320 

Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 

Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
355 360 355 

Ala He Leu Pro Leu Arg Gly Lys He He Asn Val Glu Lys Ala Arg 
370 375 380 

He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 . 390 395 400 

Leu Gly Thr Gly He His Asp Glu Phe Asg lie Thr Lys Leu Arg Tyr 



405 

His Lys He Val Leu 
420 



415 
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<210> 17 

<211> 1263 
<212> DNA 

<213> Mycobacterium avium 

<220> 

<221> CDS 

<222> (1)..(1263) 

<400> 17 

ggc gag aac age ggc tac aac gtc age ggc ggt ctg cac ggc gtc ggc 48 

61 y 61 u Asn Ser Gly Tyr Asn Vai Ser Gly Gly Leu His Gly Val Gly 
1 5 10 15 

gtc teg gtg gtc aac gcg ctg tec act egg etc gag gtc aac ate gee 96 
Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asn He Ala 
20 .25 30 



egc gac ggc tac gag tgg teg cag tac tac gac cac gee gtg ccc ggc 
Arg Asp Gly Tyr Glu Trp Ser Gin Tyr Tyr Asp His Ala Val Pro Gly 
35 40 45 



144 



acc etc aag cag ggc gag gec acc aag egc acc ggc ace acc ate egg 192 
Thr Leu Lys Gin Gly Glu Ala Thr Lys Arg Thr Gly Thr Thr He Arg 
50 55 60 



ttc tgg gec gac ccc gac ate ttc gag acc acc gag tac gac ttc gaa 240 

Phe Trp Ala Asp Pro Asp He Phe Glu Thr Thr Glu Tyr Asp Phe Glu 

65 70 75 80 

acg gtg gec egg egg ctg cag gaa atg gcg ttc etc aac aag ggc ctg 288 

Thr Val Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu 
85 90 95 

acc ate aac etc acc gac gag egg gtg acc aac gaa gag gtc gtc gac 336 

Thr lie Asn Leu Thr Asp 61u Arg Val Thr Asn Glu Glu Val Val Asp 
100 105 110 

gag gtg gtc age gac acc gee gac gca ccc aag teg gcg cag gag aag 384 

Glu Val Val Ser Asp Thr Ala Asp Ala Pro Lys Ser Ala Gin Glu Lys 
« 115 120 125 

gcg gcg gaa teg get gcg ceg cat aag gtc aag cac egc acc ttc cac 432 

Ala Ala Glu Ser Ala Ala Pro His Lys Val Lys His Arg Thr Phe His 
130 135 140 

45 

tac ccc ggc ggc ctg gtc gac ttc gtc aaa cac ate aat egc acc aaa 480 

Tyr Pro Gly Gly Leu Val Asp Phe Val Lys His lie Asn Arg Thr Lys 

145 150 155 160 



aac ccc ate cac cag age ate ate gat ttc ggt ggg aag ggc ccc gac 528 
Asn Pro He His Gin Ser He He Asp Phe Gly Gly Lys Gly Pro Gly 
165 170 175 



35 
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cac gag 

His Glu 



10 



15 



25 



30 



35 



40 



50 



gtg 
Val 



cac 
His 



gag gag 
Glu Glu 
210 

aag gac 
Lys Asp 
225 

gac ate 
Asp He 



ccg cag 
Pro Gin 



aag teg 
Lys Ser 



gaa gee 
Glu Ala 
2S0 

tea gcg 
Ser Ala 
305 

cgc aag 
Arg Lys 



gtc gag ate 
Val Glu He 
180 

ace ttc gee 
Thr Phe Ala 
195 

ggc ttc cgc 
Gly Phe Arg 



aag aag ctg 
Lys Lys Leu 



cgc gag ggt 
Arg Glu Gly 
245 

ttc gag ggc 
Phe Glu Gly 
260 

ttc gtg cag 
Phe Val Gin 
275 

aac ccc gca 
Asn Pro Ala 



gcg atg 
Ala Met 



aac acc 
Asn Thr 



age gcg 
Ser Ala 
215 

etc aag 
Leu Lys 
230 

ttg gcc 
Leu Ala 



cag tgg aac ggc 
Gin Trp Asn Gly' 
185 

ate aac acg cac 
He Asn Thr His 
200 

ctg acc tec gtg 
Leu Thr Ser Val 



ggc tac 
Gly Tyr 



cag acc 
Gin Thr 



aag gtg 
Lys Val 



cag gcg cgc 
Gin Ala Arg 



tgc egg 
Cys Arg 



gac teg 
Asp Ser 



gcc ate 
Ala He 
370 



age gca acc 
Ser Ala Thr 
325 

teg acc gac 
Ser Thr Asp 
340 

gcc ggc ggc 
Ala Gly Gly 
355 

ctt ccg ctg 
Leu Pro L : eu 



gac gcc 
Asp Ala 
295 

ate gee 
He Ala 
310 

gac ctg 
Asp Leu 



gac aag gac ccc 
Asp Lys Asp Pro 
235 

gcg gtg ate teg 
Ala Val He Ser 
250 

aag acc aaa etg 

Lys Thr Lys Leu. 
265 

tgc aac gaa cag 
s Asn Glu Gin 

10 

aaa gtc att gtc 
Lys Val He Val 



gag gge 
Glu Gly 
205 

gtc aac 
Val Asn 
220 

aac ctg 
Asn Leu 



tec gaa teg 
Ser Glu Ser 
190 

gge acc cac 
Gly Thr His 



aag tac gcc 
Lys Tyr Ala 



gtc aag 
Val Lys 



c aac 
y Asn 



ccg cgc 
Pro Arg 



teg gcg 
Ser Ala 



cgc ggc 
Arg Gly 
375 



gcg cgc aag gcg 
Ala Arg Lys Ala 

315 

gge gcg ctg ccc 
Gly Gly Leu Pro 
330 

aag teg gaa ttg 
Lys Ser Glu Leu 
345 

aaa age ggc egg 
Lys Ser Gly Arg 
360 

aag ate ate aac 
Lys He He Asn 



etc acc 
Leu Thr 
285 

aac aag 
Asn Lys 
300 

cga gag 
Arg Glu 



ace ggc gac 
Thr Gly Asp 

gtg age gaa 
Val Ser Glu 
255 

aec gag gtg 
Thr Glu Val 
270 

cac tgg ttc 
His Trp Phe 



gcg gtt tea 
Ala Val Ser 



ttg gtg cgc 
Leu Val Ari 
32 



y Lys 

tat gtg 
Tyr Val 



gac teg 
Asp Ser 
365 

gtc gaa 
Val Glu 
380 



etc gcc gac 
Leu Ala Asp 
335 

gtc gag ggt 
Val Glu Gly 
350 

atg ttc cag 
Met Phe Gin 



aag gcc cgc 
Lys Ala Arg 



ate gae egg gtg ctg aag aac acc gag gtg cag gcg ate ate acc gcg 
He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 ' 400 



576 

624 

672 

720 

768 

•816 

864 

912 

960 

1008 

1056 

1104 

1152 

1200 
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ctg 
Leu 


ggc acc ggg att cac gac 
Gly Thr Sly He His Asp 
405 * 


gag ttc gac. ate acc aag ctg 
Glu Phe Asp He Thr Lys Leu 
410 


cgc 
Arg 
415 


tac 
Tyr 


5 


cac aag ate gta 
His Lys He VaT 
420 


ttg 
Leu 










10 


<210> 18 
<211> 421 
<212> PRT 

<213> Mycobacterium avium 




• 




15 


<400> 18 

61 y Glu Asn Ser 


Gly Tyr 
5 


Asn 


Val Ser Gly Gly Leu His Gly 
10 


Val 
15 


Gly 


20 


Val 


Ser Val Val 


Asn Ala 


Leu 


Ser Thr Ara Leu Glu Val Asn 
25" 30 


lie 


Ala 


Arg 


Asp Gly Tyr 

WW 


Glu Trp 


Ser 


Gin Tyr Tyr Asp His Ala Val 
40 45 


Pro 


Gly 


25 


Thr 


Leu Lys Gin 
50 


Gly Glu 


Ala 
55 


Thr Lvs Ara Thr Glv Thr Thr 

Jill im J *J *»l llll W I J lilt lilt 

60 


He 


Arg 




Phe 
65 


Trp Ala Asp 


Pro Asp 
70 


He 


Phe Glu Thr Thr Glu Tvr Asd 
75 


Phe 


Glu 
80 


30 


Thr 


Val Ala Arg 


Arg Leu 
85 


Gin 


Glu Met Ala Phe Leu Asn Lys 
90 


Gly 
95 


Leu 




Thr 


lie Asn Leu 
100 


Thr Asp 


Glu 


Arg Val Thr Asn Glu Glu Val 
105 110 


Val 


Asp 


35 


Glu 


Val Val Ser 
115 


Asp Thr 


Ala 


Asp Ala Pro Lys Ser Ala Gin 
120 " 125 


Glu 


Lys 


t 


Ala 


Ala 61 u Ser 
130 


Ala Ala 


Pro 
135 


His Lys Val Lys His Arg Thr 
140 


Phe 


His 


40 


Tyr 
145 


Pro Gly Gly 


Leu Val 
150 


Asp 


rne vai uys nis lie nsn nrg 
155 


inr 


Lys 
160 




Asn 


Pro He His 


Gin Ser 
165 


He 


He Asp Phe Gly Gly Lys Gly 
170 


Pro 

175 


Gly 


45 


His 


Glu Val Glu 
180 


He Ala 


Met 


Gin Trp Asn Gly Gly Tyr Ser 
185 190 


Glu 


Ser 


50 


Val 


His Thr Phe 
195 


Ala Asn 


Thr 


He Asn Thr His Glu Gly Gly 
200 205 


Thr 


His 




Glu 


Glu Gly Phe 
210 


Arg Ser 


Ala 
215 


Leu Thr Ser VaT Val Asn Lys 
220 


Tyr 


Ala 



1263 
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Lys 
225 


Asp Lys 


Lys 


Leu Leu Lys Asp Lys Asp rro Asn Leu inr uiy 
230 235 


A_*t 

Asp 
240 


5 


Asp 


He Arg 


Glu 


Gly Leu Ala Ala Val He Ser Val Lys Val Ser 
245 250 255 


Glu 




rro 


bin rne 


CI if 

260 


uiy Bin Thr Lys Thr Lys Leu Gly Asn Thr Glu 
265 270 


Val 


10 


Lys 


ser rne 
275 


val 


61 n Lys Val Cys Asn Glu Gin Leu Thr His Trp 
280 285 


Phe 


15 


Glu 


Ala Asn 
290 


Pro 


Ala Asp Ala Lys Val He Val Asn Lys Ala Val 
295 300 


Ser 




Ser 
305 


Ala Gin 


Ala 


Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val 

310 • 315 


Arg 
320 


20 


Arg 


Lys Ser 


AT « 

Ala 


Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala 
325 330 335 


Asp 




Cys 


Arg Ser 


TL _ 

Thr 
340 


Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu 
345 350 


Gly 


25 


Asp 


Ser Ala 
355 


Gly 


Gly Ser Ala Lys Ser Gly Arg. Asp Ser Met Phe 
360 365 


Gin 




Ala 


lie Leu 
370 


Pro 


Leu Arg Gly Lys He He Asn Val Glu Lys Ala 
375 380 


Arg 


30 


He 
3B5 


Asp Arg 


Val 


Leu Lys Asn Thr Glu Val Gin Ala He He Thr 
390 395 ■ 


Ala 
400 




Leu 


Gly Thr 


Gly 


He His Asp Glu Phe Asp He Thr Lys Leu Arg 
405 410 415 


Tyr 


35 


His 


Lys He 


Val 
420 


Leu 





<210> 19 
« <211> 1263 



so 



55 



<212> DNA 
<213> Mycobacterium gordonae 



<220> 

« <221> CDS 



<222> (1)..(1263) 
<400> 19 

ggc gag aac age ggc tac acg gtc age ggt ggg ttg cac ggc gtg ggc 48 

Gly Glu Asn Ser Gly Tyr Thr Val Ser Sly Gly Leu His Gly Val Gly 

1 5 10 15 

gtg teg gtg gtt aac gcg ttg teg acg egg ttg gaa gtc gac ate aaa 96 
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Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asp He Lys 
20 25 , 30 



cgc gac ggg 
Arg Asp 61 y 
35 



cac gag 
His Glu 



tgg teg cag 
Trp Ser Sin 
40 



tat tac 
Tyr Tyr 



aag cgc gcg gtg 
Lys Arg Ala Val 
45 



ccg g^c 



Pro 



144 



10 



acc etc aag 
Thr Leu Lys 
50 



cag ggt 
Gin Gly 



ag acg acc cgc aag acc ggc acc aca 
lu Thr Thr Arg Lys Thr Gly Thr Thr 
55 60 



ate egg 192 
He Arg 



75 



ttc tgg gcg 
Phe Trp Ala 
65 

acg gtg gcg 
Thr Val Ala 



gat ccg 
Asp Pro 



cgc egg 
Arg Arg 
85 



gag ate ttc 
Glu He Phe 
70 

ctg cag gag 
Leu Gin Glu 



lag acc 
lu Thr 



atg gcg 
Net Ala 

90 



acc caa tac gac 
Thr Gin Tyr Asp 
75 

ttc ctg aac aag 
Phe Leu Asn Lys 



ttc gag 240 
Glu 



Phe 



80 



ggt ctg 288 
Gly Leu 
95 



20 



acg ate aat 
Thr He Asn 



ctg acc 
Leu Thr 
100 



gac gaa cgc 
Asp Glu Arg 



gtc gag 
Val Glu 
105 



cag gac gag gtt 
Gin Asp Glu Val 
110 



gtc gac 336 
Val Asp 



25 



gag gtc gtc 
Glu Val Val 
115 

get gee gaa 
Ala Ala Glu 
130 



age gac 
Ser Asp 



tee aag 
Ser Lys 



acc gec gaa 
Thr Ala Glu 
120 

gee eeg eae 
Ala Pro His 
135 



gcg ccc 
Ala Pro 



aag gtc 
Lys Val 



aaa tec gee gaa 
Lys Ser Ala Glu 
125 

aag cag cgc acc 
Lys Gin Arg Thr 
140 



gag aag 384 
Glu Lys 



ttc cac 432 
Phe His 



30 



tat ccc ggt 
Tyr Pro Gly 
145 



ggt ctg 
Gly Leu 



gtc gac ttc 
Val Asp Phe 
150 



gtc aaa 
Val Lys 



cac ate aac cgc 
His He Asn Arg 
155 



acc aaa 480 
Thr Lys 
160 



age ccg ate 
Ser Pro lie 



cag cag 
Gin Gin 
165 



age gtc ate 
Ser Val He 



gac ttc 
Asp Phe 
170 



gaa ggc aaa ggc 
Glu Gly Lys Gly 



acc ggc 
Thr Gly 
175 



528 



40 



cac gag gtc 
His Glu Val 



gtg 
Val 



cac acc 
His Thr 
195 



gaa ate 
Glu He 
180 

ttc gee 
Phe Ala 



gcg atg cag 
Ala Met Gin 



aac acc ate 
Asn Thr He 
200 



tgg aac 
Trp Asn 
185 

aae acc 

Asn Thr 



GIG 



cac gag 
His Glu 



tac tec 
Tyr Ser 
190 

ggc ggc 
Gly Gly 
205 



gaa teg 576 
Glu Ser 



acc cac 624 
Thr His 



45 



aa gag ggc 
lu Glu Gly 
210 . 



ttc cgc 
Phe Arg 



agt gcg ctg 
Ser Ala Leu 
215 



acc teg 
Thr Ser 



gtg gtc 
Val Val 
220 



aac aag 
Asn Lys 



tac gec 672 
Tyr Ala 



50 



aaa gac aag 

Lys Asp Lys 
225 



aag ctg 
Lys Leu 



etc aag gag 
Leu Lys Glu 
230 



aag gae 

Lys Asp 



ccg aat 
Pro Asn 
235 



etc acc 
Leu Thr 



got gac 720 
Sly Asp 
240 



gac ate egg gag ggg ttg gee gcg gtg ate teg gtg aag gtc gec gaa 768 



55 
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Asp He Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ala Glu 
245 . . 25.0 255 

ccg cag ttc gag ggt cag acc aag acc aag ctg ggc aac acc gag gtc 
Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val 
260 265 270 

aag teg ttc gtg cag aag gtg tgc aac gaa cag etc acc cac tgg ttc 
Lys Ser Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe 
275 280 285 

gag gee aat ccg teg gaa get aaa acc gtt gtg aac aaa gcg gtg teg 
Glu Ala Asn Pro Ser Glu Ala Lys Thr Val Val Asn Lys Ala Val Ser 
290 295 300 

tec gee cag gcg egg ate gee gcg cgc aaa gcg cga gag ctg gtg cgc 
Ser Ala Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg 
305 310 315 320 

cgc aag age gca acc gac etc ggc ggc ctg ccg age aag etc gee gac 
Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 

tgc cgt teg acg gat ccc cgc aaa tec gaa ctg tat gtg gtg gag ggg 
Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

gac tec gee ggc ggc teg gee aag age gat egg gat teg atg ttc cag 
Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
355 360 365 

gcg att ett ccg ttg cgc ggc aag ate ate aac gtc gag aag gee cgc 
Ala lie Leu Pro Leu Arg Gly Lys lie He Asn Val Glu Lys Ala Arg 
370 375 380 

ate gac egg gtg ctg aag aac acc gaa gtc cag gee ate ate acc gcg 
lie Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 400 

ctg ggc acc ggg ate cac gac gag ttc gac ate acc aaa ctg cgc tac 
Leu Gly Thr Gly He His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 
405 410 41.5 

cac aag ate gta ttg 
His Lys He Val Leu 
420 



<210> 20 
<211> 421 
<212> PRT 

<213> Mycobacterium gordonae 
<400> 20 

Gly Glu Asn Ser Gly Tyr Thr Val Ser Gly Gly Leu His Gly Val Gly 
15 10 15 



40 
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Val Ser Val Val Asn Ala Leu Ser 
20 


Thr 
25 


Arg Leu 


Glu 


Val 


Asp 
30 


He Lys 


Ara Asp Gly His 61 u Trp Ser Gin 


Tyr 


Tyr Lys 


Arg 


Ala 
45 


Val 


Pro Gly 


Thr Leu Lys Gin Gly Glu Thr Thr 
50 55 


Arg 


Lys Thr 


Gly 
60 


Thr 


Thr 


He Arg 


Phe Trp Ala Asp Pro Glu He Phe 
65 70 


Glu 


Thr Thr 
75 


Gin 


Tyr 


Asp 


Phe Glu 
80 


Thr Val Ala Arg Arg Leu 61 n Glu 
85 


net 


Aia 
90 


rna 


Leu 


Asn 


Lys 


Gly Leu 
95 


Thr He Asn Leu Thr Asp 61 u Arg 
100 


Val 
105 


o 1 U 


Gin 

u in 


Asp 


Glu 


Val 
110 


Val Asp 


Glu Val Val Ser Asp Thr Ala Glu 
115 120 


Ala 


Pro 


Lys 


Ser 


Ala 
125 


Glu 


Glu Lys 


Ala Ala Glu Ser Lys Ala Pro His 
130 135 


Lys 


Val 


Lys 


Gin 
140 


Arg 


Thr 


Phe His 


Tyr Pro Gly Gly Leu Val Asp Phe 
145 150 


Val 




His 

n id 

155 


lie 


Asn 


Arg 


Thr Lys 
160 


Ser Pro lie Bin Bin ser Val lie 
165 


ASp 


Phe 
170 


Glu 


uly 


Lys 


Gly 


inr oiy 
175 


His Glu Val Glu He Ala Met Gin 
180 


Trp 
185 


Asn 


Gly 


Gly 


Tyr 


Ser 
190 


Glu Ser 


Val His Thr Phe Ala Asn Thr He 
195 200 


Asn 


Thr 


His 


Glu 


Gly 
205 


Gly 


Thr His 


ftlii Hill 151 1/ Dk A Aca (a ^ Al » 1 an 

UIU ulll oiy rne Ary «er Ala L6U 
210 215 


inr 


Ser 


Val 


Vol 

220 


ten 

Asn 


Lys 


iyr Mia 


Lys Asp Lys Lys Leu Leu Lys Glu 
225 230 


Lys 


Asp 


Pro 
235 


Asn 


Leu 


Thr 


Gly Asp 
240 


Asp He Arg Glu Gly Leu Ala Ala 
245 


Val 


He 
250 


Ser 


Val 


Lys 


Val 


Ala Glu 
255 


Pro Gin Phe Glu Gly Gin Thr Lys 
260 


Thr 
265 


Lys 


Leu 


Gly 


Asn 


Thr 
270 


Glu Val 


Lys Ser Phe Val Gin Lys Val Cys 
275 280 


Asn 


Glu 


Gin 


Leu 


Thr 
285 


His 


Trp Phe 


Glu Ala Asn Pro Ser Glu Ala Lys 
290 295 


Thr 


Val 


Val 


Asn 
300 


Lys 


Ala 


Val Ser 


Ser Ala Gin Ala Arg He Ala Ala 


Ar9 


Lys 


Ala 


Arg 


Glu 


Leu 


Val Arg 
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35 



40 



50 
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305 310 315 320 

• Arg Lys Ser Ala Thr Asp" Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 

Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

Asp Ser Ala Gly 61 y Ser Ala Lys Ser Gly Arg Asp Ser Met PhB Gin 
355 360 365 

Ala He Leu Pro Leu Arg Gly Lys He He Asn Val Glu Lys Ala Arg 
370 375 380 

is He Asp Arg Val Leu Lys Asn Thr 61 u Val Gin Ala He He Thr Ala 

385 390 395 400 

Leu Gly thr Gly He His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 
405 410 415 

20 His Lys He Val Leu 

420 

<210> 21 
25 <211> 1257 

<212> ONA 

<213> Mycobacterium africanum 

<220> 
<221> COS 
30 <222> CD. .(1257) 

<400> 21 

teg gac gcg tat gcg ata tct ggt ggt ctg cac ggc gtc ggc gtg teg 48 
Ser Asp Ala Tyr Ala He Ser Gly Gly Leu His Gly Val Gly Val Ser 
15 10 15 

gtg gtt aac gcg eta tec acc egg etc gaa gtc gag ate aag cgc gac 96 
Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu lie Lys Arg Asp 
20 25 30 



ggg tac gag tgg tct cag gtt tat gag aag teg gaa ccc ctg ggc etc 144 
Gly Tyr Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Leu Gly Leu 
35 40 45 

aag caa ggg gcg ccg acc aag aag acg ggg tea acg gtg egg ttc tgg 192 
Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Ser Thr Val Arg Phe Trp 
50 55 60 

gee gac ccc get gtt ttc gaa acc acg gaa tac gac ttc gaa acc gtc 240 
Ala Asp Pro Ala Val Phe Glu Thr Thr Glu Tyr Asp Phe Glu Thr Val 
65 70 75 80 

gec cgc egg ctg caa gag atg gcg ttc etc aac aag ggg ctg acc ate 288 
Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu Thr He 
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85 



90 



10 



15 



20 



30 



40 



45 



50 



aac ctg acc 
Asn Leu Thr 



gtc age gac 
Val Ser Asp 
115 

gaa tec act 
Glu Ser Thr 
130 

ggt ggc ctg 
Gly 61 y Leu 
145 . 

att cat age 
lie His Ser 



gtg gag ate 
Val Glu He 



acc ttc gec 
Thr Phe Ala 
195 

ggc ttc cgc 
Gly Phe Arg 
210 

cgc aag eta 
Arg Lys Leu 
225 

egg gaa gac 
Arg Glu Gly 



gac gag agg gtg acc caa gac 
' 1 Thr Gin Asp 



Asp Glu Arg Va 
100 



105 



ag gtc 
lu Val 



gtc gec gag gcg ccg aag teg 
Val Ala Glu Ala Pro Lys Ser 
120 

gea ccg cac aaa gtt aag age 
Ala Pro His Lys Val Lys Ser 
135 

9tg gac ttc gtg aaa cac ate 
Val Asp Phe Val Lys His He 
150 

age ate gtg gac ttt tec ggc 
Ser He Val Asp Phe Ser Gly 
165 170 



gca agt 
Ala Ser 



cgc acc 
Arg Thr 
140 

aac cgc 
Asn Arg 
155 

aag ggc 
Lys Gly 



gcg atg caa tgg aac gec ggg 
Ala Met Gin Trp Asn Ala Gly 
180 185 

aac acc ate aac acc cac gag 
Asn Thr He Asn Thr His Glu 
200 

age gcg ctg acg teg gtg gtg 
Ser Ala Leu Thr Ser Val Val 
215 

ctg aag gae aag gac ccc aac 
Leu Lys Asp Lys Asp Pro Asn 
230 

ctg gec get gtg ate teg gtg 
Leu Ala Ala Val He Ser Val 
245 250 



ttc gag ggc cag acc aag acc aag ttg ggc 
Phe Glu Gly Sin Thr Lys Thr Lys Leu Gly 
250 265 

ttt gtg cag aag gtc tgt aac gaa cag ctg 
Phe Val Gin Lys Val Cys Asn Glu Gin Leu 
275 280 

aac ccc acc gac teg aaa gtc gtt gtg aac 
Asn Pro Thr Asp Ser Lys Val Val Val Asn 
290 295 

caa gec cgt ate gcg gca cgt aag gca cga 
Gin Ala Arg He Ala Ala Arg Lys Ala Arg 



tat teg 
Tyr Ser 



?c ggc 
1y Gly 



aac aag 
Asn Lys 
220 

etc acc 
Leu Thr 
235 

aag gtc 
Lys Val 



aac acc 
Asn Thr 



acc cac 
Thr His 



aag get 
Lys Ala 
300 

gag ttg 

Glu Leu 



95 

gtc gac gaa gtg 336 
Val Asp Glu Val 
110 

gaa cgc gca gec 384 
Glu Arg Ala Ala 
125 

ttt eae tat ccg 432 
Phe His Tyr Pro 



acc aag aac gcg 480 
Thr Lys Asn Ala 
160 

acc ggg cac gag 528 
Thr Gly His Glu 
175 

gag teg gtg cac • 576 
Glu Ser Val His 
190 

acc cac gaa gag 624 
Thr His Glu Glu 
205 

tac gec aag gac 672 
Tyr Ala Lys Asp 



ggt gac gat ate 720 

Gly Asp Asp He 
240 

age gaa ccg cag 768 

Ser Glu Pro Gin 
255 

gag gtc aaa teg 816 

Glu Val Lys Ser 
270 

tgg ttt gaa gec 864 

Trp Phe Glu Ala 
285 

gtg tec teg gcg 912 

Val Ser Ser Ala 



gtg egg cgt aag 950 
Val Arg Arg Lys 
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305 . 310 






315 


320 




5 


age gec acc gac ate ggf gga ttg 
Ser Ala Thr Asp He Gly Gly Leu 
325 


ccc 
Pro 


9fc 
Sly 
330 


aag ctg gec gat tgc 
Lys Leu Ala Asp Cys 
'335 


cgt 
Arg 


1008 


10 


tee acg gat ccg cge aag tee gaa 
Ser Thr Asp Pro Arg Lys Ser 61u 
340 


ctg 
Leu 
345 


tat 

Tyr 


gtc gta gaa gat gac 
Val Val 61 u Gly Asp 
350 


teg 
Ser 


1056 




gee gge ggt tet gca aaa age ggt 
Ala Gly 61 y Ser Ala Lys Ser Gly 
355 360 


cgc 
Arg 


gat 
Asp 


teg atg ttc cag gcg 
Ser Met Phe Gin Ala 
365 


ata 
He 


1104 


15 


cul ccg cxg cgc gge aag axe axe 
Leu Pro Leu Arg Gly Lys He He 
370 375 


aax 
Asn 


gtg 
Val 


gag aaa gcg cgc ate 
Glu Lys Ala Arg lie 
380 


gac 
Asp 


1 1 o 

11 92 


20 


egg gta eta aag aac acc gaa gtt 
Arg Val Leu Lys Asn Thr" Glu Val 
365 390 


cag 
Gin 


R 


ate ate acg gcg ctg 
He lie Thr Ala Leu 
395 


K 

400 


1200 




ace gsg ate cac gac gag ttc gat 
Thr Gly lie His Asp Glu Phe Asp 
405 


ate 
He 


m 

410 


aag .ctg cgc tac cac 
Lys Leu Arg Tyr His 
415 


aag 
Lys 


1248 


25 


ate gtg ctg 
He Val Leu 










1257 


30 


<210> 22 
<211> 419 
<212> PRT 

<213> Mycobacterium africanum 












35 


<400> 22 

Ser Asp Ala Tyr Ala He Ser Gly 
1 5 


Gly 


Leu 
10 


His Gly Val Gly Val 
15 


Ser 






Val Val Ac;n Ata 1 pii Spr Thr Aro 
veil vai n»n Ata lcu oci inr nrg 

20 


1 All 

Leu 
25 


Gin 


vai oiu lie Lys nrg 
30 


Asp 




40 


Gly Tyr Glu Trp Ser Gin Val Tyr 
35 40 


Glu 


Lys 


Ser Glu Pro Leu Gly 
45 


Leu 






Lys Gin Gly Ala Pro Thr Lys Lys 
50 55 


Thr 


Gly 


Ser Thr Val Arg Phe 
60 


Trp 




45 


Ala Asp Pro Ala Val Phe Glu Thr 
65 70 


Thr 


Glu 


Tyr Asp Phe Glu Thr 

75 • 


Val 
80 




50 


Ala Arg Arg Leu Gin Glu Met Ala 
85 


Phe 


Leu 
90 


Asn Lys Gly Leu Thr 
95 


He 






Asn Leu Thr Asp Glu Arg Val Thr 
100 


Gin 
105 


Asp 


Gly Val Val Asp Glu 
110 


Val 





55 
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Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser 61u Arg Ala Ala 
115 120 125 

Glu Ser Thr Ala Pro His Lys Val Lys Ser Arg Thr Phe His Tyr Pro 
130 135 140 

Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys Asn Ala 



Ely 
145 



150 



155 



160 



He His Ser Ser lie Val Asp Phe Ser Gly Lys Sly Thr Gly His Glu 



165 



175 



Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val His 
180 185 190 

Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His Glu Glu 
195 200 205 

Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala Lys Asp 
210 215 220 

Arg Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp Asp He 
225 230 235 240 

Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ser 61 u Pro Gin 
245 250 255 

Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val Lys Ser 
260 265 270 

Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu Ala 
275 280 285 

Asn Pro Thr Asp Ser Lys Val Val Val Asn Lys Ala Val Ser Ser Ala 
290 295 300 

Gin Ala Arg lie Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 
305 310 315 320 

Ser Ala Thr Asp He Gly Gly Leu Pro Gly Lys Leu Ala Asp Cys Arg 
325 330 335 

Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly Asp Ser 
340 345 350 

Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala He 
355 360 365 

Leu Pro Leu Arg 61 y Lys lie He Asn Val Glu Lys Ala Arg He Asp 
370 375 380 

Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala Leu Gly 
385 390 395 400 

Thr Gly He His Asp Glu Phe Asp He Gly Lys Leu Arg Tyr His Lys 



45 



15 



20 



35 



40 



45 



50 



EP 1 098 003 A2 
405 410 415 



He Val Leu 



<210> 23 

<211> 1257 

<212> DNA 

<21 3> Mycobacteri urn tubercul osi s 

<220> 

<221> CDS 

<222> (1). .(1257) 

<400> 23 

teg gac gcg tat gcg ata tct ggt ggt ctg cac ggc gtc ggc gtg teg 48 
Ser Asp Ala Tyr Ala He Ser Gly Gly Leu His Gly Val Gly Val Ser 
1-5 10 15 

gtg gtt aac gcg eta tec acc egg etc gaa gtc gag ate aag cgc gac 96 
Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Lys Arg Asp 
20 25 30 

ggg tac gag tgg tct cag gtt tat gag aag teg gaa ccc ctg ggc etc 144 
Gly Tyr 61 u Trp Ser Gin Val Tyr 61u Lys Ser Glu Pro Leu Gly Leu 
35 40 45 



aag caa ggg gcg ccg acc aag aag acg ggg tea acg gtg egg ttc tgg 
Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Ser Thr Val Arg Phe Trp 
30 50 55 60 



192 



gee gac ccc get gtt ttc gaa acc acg gaa tac gac ttc gaa acc gtc 240 
Ala Asp Pro Ala Val Phe Glu Thr Thr Glu Tyr Asp Phe Glu Thr Val 
65 70 75 80 



gee cgc egg ctg caa gag atg gcg ttc etc aac aag ggg ctg acc ate 288 
Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu Thr He 
85 90 95 

aac ctg acc gac gag agg gtg acc caa gac gag gtc gtc gac gaa gtg 336 
Asn Leu Thr Asp Glu Arg Val Thr Gin Asp Glu Val Val Asp Glu Val 
100 105 110 

gtc age gac gtc gcc gag gcg ccg aag teg gca agt gaa cgc gca gcc 384 
Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Arg Ala Ala 
115 120 125 

gaa tec act gca ccg cac aaa gtt aag age cgc acc ttt cac tat ccg 432 
Glu Ser Thr Ala Pro His Lys Val Lys Ser Arg Thr Phe His. Tyr Pro 
130 135 140 

ggt ggc ctg gtg gac ttc gtg aaa cac ate aac cgc acc aag aac gcg 480 
Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys Asn Ala 
145 150 155 160 



46 
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528 



576 



624 



10 



15 



20 



25 



30 



35 



40 



45 



50 



age tte cgc age gcg etg aeg teg gtg gtg aac aag tac gee aag gae 672 
Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala Lys Asp 
210 215 220 

cgc aag eta etg aag gae aag gac ccc aac etc acc ggt gae gat ate 720 
Arg Lys Leu Leu Lys Asp. Lys Asp Pro Asn Leu Thr Gly Asp Asp lie 



ttt gtg cag aag gtc tgt aac gaa cag etg acc cac tgg ttt gaa gee 864 
Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu Ala 
275 280 285 

aac ccc acc gac gcg aaa gtc gtt gtg aac aag get gtg tec teg gcg 912 
Asn Pro Thr Asp Ala Lys Val Val Val Asn Lys Ala Val Ser Ser Ala 
290 295 300 

caa gec cgt ate gcg gca cgt aag gca ega gag ttg gtg egg egt aag 960 
Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 
305 310 315 320 

age gec acc gac ate ggt gga ttg ccc ggc aag etg gec gat tge cgt 1008 
Ser Ala Thr Asp He Gly Gly Leu Pro Gly Lys Leu Ala Asp Cys Arg 
325 330 335 

tec acg gat ccg cgc aag tec gaa etg tat gtc gta gaa ggt gac teg 1056 
Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly Asp Ser 
340 345 350 

gec ggc ggt tct gca aaa age ggt cgc gat teg atg ttc cag gcg ata 1104 
•Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala He 
355 360 365 

ctt ccg etg cgc ggc aag ate ate aat gtg gag aaa gcg cgc ate gac 1152 
Leu Pro Leu Arg Gly Lys lie He Asn Val Glu Lys Ala Arg He Asp 
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egg gtg eta aag aac ace gaa gtt cag gcg ate ate acg gcg ctg ggc 1200 
Are Val Leu Lys Asn Thr G1u Val Gin Ala He He Thr Ala Leu Gly 
385 390"-:. 395 400 



acc ggg ate cae gac gag ttc gat ate ggc aag ctg cgc tac cac aag 1248 
Thr Gly He His Asp Glu Phe Asp He Gly Lys Leu Arg Tyr His Lys 
405 410 415 



ate gtg ctg 1257 
He Val Leu 

<210> 24 
<211> 419 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 24 

Ser Asp Ala Tyr -Ala He Ser Gly Gly Leu His Gly Val Gly Val Ser 
15 10 15 

Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Lys Arg Asp 
20 25 30 

Gly Tyr Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Leu Gly Leu 
35 40 45 

Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Ser Thr Val Arg Phe Trp 
50 55 60 

Ala Asp Pro Ala Val Phe Glu Thr Thr Glu Tyr Asp Phe Glu Thr Val 
65 70 75 80 

Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lye Gly Leu Thr He 
85 . 90 95 

Asn Leu Thr Asp G1u Arg Val Thr Gin Asp Glu Val Val Asp Glu Val 
100 105 110 

Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Arg Ala Ala 
115 120 125 

Glu Ser Thr Ala Pro His Lys Val Lys Ser Arg Thr Phe His Tyr Pro 
130 135 140 

Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys Asn Ala 
145 150 155 160 

He His Ser Ser He Val Asp Phe Ser Gly Lys Gly Thr Gly His Glu 
165 170 . 175. 

Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val His 
180 185 190 

Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His Glu Glu 
195 200 205 
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Gly 


Phe Arg 
210 


Ser 


Al a Leu 


Thr Ser Val 
215 


Val 


Asn 


Lys Tyr Ala 
220 


Lys Asp 


5 


Arg 
225 


Lys Leu 


Leu 


Lys Asp 
230 


Lys Asp Pro 


Asn 


Leu 
235 


Thr Gly Asp 


Asp He. 
240 




Arc 


Glu Gly 


Leu 


Ala Ala 

245 . 


Val He Ser 


Val 
250 


Lys 


Val Ser Glu 


Pro Gin 
255 


10 


Phe 


61u Sly 


Gin 
260 


Thr Lys 


Thr Lys Leu 
265 


Gly 


Asn 


Thr Glu Val 
270 


Lys Ser 


15 


rue 


Val Gin 
275 


Lvs 


Val Cvs 


Asn Glu Gin 
280 


Leu 


Thr 


His Trp Phe 
285 


Glu Ala 


noil 


Pro Thr 

rl u I in 

290 


Acq 
nop 


Ala Lvs 


Val Val Val 

■HI Ittl TO! 

295 


Asn 


Lvs 


Ala Val Ser 

300 


Ser Ala 


20 


Gin 

Will 

305 


Al a Aro 


He 


Ala Ala 
310 


Ara Lvs Ala 


Arg 


Glu 
315 


Leu Val Arq 


Arg Lys 
320 






Ala Thr 


Asd 


He Gly 
325 


Glv Leu Pro 


Gly 
330 


Lvs 


Leu Ala Asd 


Cys Arg 
335 


25 


Ser 


Thr Asp 


Pro 
340 


Arg Lys 


Ser Glu Leu 
345 


Tyr 


Val 


Val Glu Gly 
350 


Asp Ser 




Ala 


Gly Gly 
355 


Ser 


Ala Lys 


Ser Gly Arg 
360 


Asp 


Ser 


Met Phe Gin 
365 


Ala lie 


30 


Leu 


Pro Leu 
370 


Arg 


Gly Lys 


He He Asn 
375 


Val 


Glu 


Lys Ala Arg 
380 


He Asp 




Arg 
385 


Val Leu 


Lys 


Asn Thr 
390 


Glu Val Gin 


Ala 


lie 
395 


He Thr Ala 


Leu Gly 
400 


35 


Thr 
He 


Gly He 
Val Leu 


His 


Asp Glu 
405 


Phe Asp He 


Gly 
410 


Lys 


Leu Arg Tyr 


His Lys 
415 



40 



<210> 25 

<211> 1257 

<212> DNA 

45 <213> Mycobacterium gastri 

<220> • . 

<221> COS 

<222> (1)..(1257) 



50 <400> 25 

tec gac l . . . — - . _ . 

Ser Asp Ala Tyr Ala lie Ser Gly Gly Leu His Gly Val Gly Val Ser 



tec gac gec tat gcg ata teg ggt gga ctg cac ggt gtg ggt gtc teg 48 
- " " ' 7|y Gly Leu His Gly Val Gly Val " 
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10 



15 



20 



25 



30 



35 



gtg gtc 
Val Val 



ggc cat 
Gly His 



aac gcg ctg 
Asn Ala Leu 
20 

iag tgg teg 
lu Trp Ser 
35 



aag caa ggc gcg ccg 
Lys Gin Gly Ala Pro 
50 



gec gac 
Ala Asp 
65 

gcg cga 
Ala Arg 



aac ctg 
Asn Leu 



gtc age 
Val Ser 



gaa ttc 
Glu Phe 
130 

ggt ggc 
8Jj Gly 



cec aac gtt 
Pro Asn Val 



egg ttg cag 
Arg Leu Gin 
85 

acc gat cag 
Thr Asp Gin 
100 

gac gtc gee 
Asp Val Ala 
115 

acc gec ccc 
Thr Ala Pro 



tec* ate egg 
Ser He Arg 



caa gtt tat 
Gin Val Tyr 
40 

acg aag aag 
Thr Lys Lys 
55 

ttt gaa acc 
Phe Glu Thr 
70 

gag.atg gcg 
Glu Net Ala 



10 

ctg gag 
Leu Glu 
25 

gag aag- 
Glu Lys 



gtg gag ate 
Val Glu lie 



acc ggc 
Thr Gly 



acc gag 

Thr Glu 



egg gta acc 
Arg Val Thr 



ttg gtt gac 
Leu Val Asp 



ate cac 
He His 



age age ate 
Ser Ser He 
165 



gtg gag ate gcg atg 
Val Glu He Ala Met 
180 



acc ttc 
Thr Phe 



ggg ttc 
Gly Phe 
210 

cgc aaa 
Arg Lys 



gec aac acc 
Ala Asn Thr 
195 

cgc age gcg 
Arg Ser Ala 



ctg etc aaa 
Leu Leu Lys 



ag gec ccg 
lu Ala Pro 
120 

cac aag gtg 
His Lys Val 
135 

ttc gtc aag 
Phe Val Lys 
150 

gtc gac ttc 
Val Asp Phe 



cag tgg aat 
Gin Trp Asn 



ate aac acc 
He Asn Thr 
200 

etc acg tec 
Leu Thr Ser 
215 

gac aag gac 
Asp Lys Asp 



ttt etc 
Phe Leu 
90 

cag gac 
Gin Asp 
105 

aag teg 
Lys Ser 



tec gag ccg 
Ser Glu Pro 
45 

acg acg gtg 
Thr Thr Val 
60 

tac gac ttc 
Tyr Asp Phe 

aac aag gag 
Asn" Lys Gly 



15 

aag cgc gac 96 
Lys Arg Asp 
30 

atg gga etc 144 
Met Gly Leu 



egg ttc tgg 192 
Arg Phe Trp 



gaa gtg gtc 
Glu Val Val 



aag aag 
Lys Lys 



cac ate 
His He 



tec gga 
Ser Gly 
170 

gee ggc 
Ala Gly 
185 

cat gag 
His Glu 



gec agt gag 
Ala Ser Glu 
125 

cgt acc ttt 
Arg Thr Phe 
140 

aac cgc acc 
Asn Arg Thr 
155 

aag ggg acc 

Lys Gly Thr 



gaa acc gtc 240 
Glu Thr Val 
80 

•etc acc ate 288 
Leu Thr lie 
95 

gac gag gtg 336 
Asp Glu Val 
110 

aag gcg gee 384 
Lys Ala Ala 



cac tat ccc 432 
His Tyr Pro 



aag aac gee 480 
Lys Asn Ala 
160 



?c cac gaa 528 
y His Glu 
175 



tat teg gag 
Tyr Ser Glu 



gtg gtg 
Val Val 



ccc aac 
Pro Asn 



ggc ggg acc 
Gly Gly Thr 
205 

aac aag tac 

Asn Lys Tyr 
220 

etc acc ggc 
Leu Thr Gly 



teg gtg cac 576 
Ser Val His 
190 

cat gaa gaa 624 
His Glu Glu 



gec aag gac 672 
Ala Lys Asp 



gac gac ate 720 
Asp Asp He 



50 
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225 230 235 240 

egg gaa ggg ttg gec gcg" gtg att teg gtc aaa gtc age gaa ceg cag 
Arg Glu Gly Leu Ala Ala Val lie Ser Val Lys Val Ser Glu Pro Gin 
245 250 255 



<210> 26 
<211> 419 
<212> PRT 

50 <213> Mycobacterium gastri 

<400> 26 



55 



768 



ttc gag ggc cag ace aag acg aaa eta ggc aac acc gag gtg aag teg B16 
Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val Lys Ser 
260 265 270 



ttc gtg cag aag gtg tgc aat gaa cag etc ace cat tgg ttc gag gec 864 

Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu Ala • 

275 280 285 

aac ccc get gat get aaa acc gtt gtc aac aag gca gtt tea teg gcg 912 

Asn Pro Ala Asp Ala Lys Thr Val Val Asn Lys Ala Val Ser Ser Ala 

290 295 300 

cag gec agg att gcg gec cgc aag gcg cgc gag ttg gtg cgc cgc aag 960 

Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 

20 305 310 315 320 . 



age gca acc- gat ctg ggc gaa eta ccg ggc aag ttg gec gae tgc cgc 1008 
Ser Ala Thr Asp Leu Gly Gly Leu Pro 61 y Lys Leu Ala Asp Cys Arg 
325 330 335 

teg acc gac ccc cgt aag tec gaa tta tat gtg gtg gag got gat .tea 1 056 
Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly Asp Ser 
340 345 350 



sec ggc ggc teg gcg aag age ggc cgc gac teg atg ttt caa gcg ate 1104 
so Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala He 

355 360 365 

R 



ttg ccg ttg cgc ggc aag ate ate aac gtc gag aag gec cgc ate gac 1152 
Lau Pro Leu Arg Gly Lys lie He Asn Val Glu Lys Ala Arg He Asp 
370 375 380 



egg gtg ctg aag aac acc gaa gtc cag gcg ate ate acc gcg ttg ggc 1200 
Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala Leu Gly 
385 390 395 ' 400 

40 acc got att cac gac gaa ttc gac ate gcg aga ctg cgt tac cac aag 1248 

Thr Gly He His Asp Glu Phe Asp He Ala Arg Leu Arg Tyr His Lys 
405 410 415 



ate gtg ctg 1257 
He Val Leu 
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Ser Asp Ala Tyr Ala lie Ser Gly Gly Leu His Gly Val Gly Val Ser 
1 5 . ., 10 15 

Val Val Asn Ala Leu Ser He Arg Leu Glu Val Glu lie Lys Arg Asp 
20 25 30 

Gly His Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Met Gly Leu 
35 40 45 

Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Thr Thr Val Arg Phe Trp 
50 55 60 

Ala Asp Pro Asn Val Phe Glu Thr Thr Glu Tyr Asp Phe Glu Thr Val 
65 70 75 80 

Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu Thr lie 
85 90 95 

Asn Leu Thr Asp Gin Arg Val Thr Gin Asp Glu Val Val Asp G1u Val 
100 105 110 

Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Lys Ala Ala 
115 120 125 

Glu Phe Thr Ala Pro His Lys Val Lys Lys Arg Thr Phe His Tyr Pro 
130 135 140 

Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys Asn Ala 
145 150 155 160 

He His Ser Ser Jle Val Asp Phe Ser Gly Lys Gly Thr Gly His Glu 



165 



175 



Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val His 
180 185 190 

Thr Phe Ala Asn Thr He Asn Thr His Glu Gly 61y Thr His Glu Glu 
195 200 205 



Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala Lys Asp 
210 215 220 

Arg Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp Asp He 
225 230 235 240 

Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ser Glu Pro Gin 
245 250 255 

Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val Lys Ser 
260 . 265 270 

Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu Ala 
275 280 285 

Asn Pro Ala Asp Ala Lys Thr Val Val Asn Lys Ala Val Ser Ser Ala 
290 295 300 
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Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 

305 310 1. • 315 320 

Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp Cys Arg 
325 330 335 

Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly Asp Ser 
340 345 350 

Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala He 
355 360 365 

Leu Pro Leu Arg Gly Lys He lie Asn Val Glu Lys Ala Arg lie Asp 
370 375 380 

Arg Val Leu Lys Asn Thr Glu Val Gin Ala lie He Thr Ala Leu Gly 
385 390 395 400 

Thr Gly lie His Asp Glu Phe Asp He Ala Arg Leu Arg Tyr His Lys 
20 405 410 415 

lie Val Leu 



<210> 27 
<211> 1263 
<212> DNA 

<213> Mycobacterium marinum 

30 <220> 

<221> CDS 

<222> (1)..(1263) 

<400> 27 

35 ggc gag aac agt ggt tac aac gtc agt ggt ggt ctg cac ggc gtg ggt 48 

Gly Glu Asn Ser Gly Tyr Asn Val Ser Gly Gly Leu His Gly Val Gly 

1 5 10 15 

gtg teg gtg gtc aac gcg ctg tec acc cga ctg gaa gtc gac ate aag 96 
40 Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asp He Lys 

20 25 30 

cgc gac gga tac gag tgg teg cag ttc tac gac cgc gec cag ccg ggc 144 
Arg Asp Gly Tyr Glu Trp Ser Gin Phe Tyr Asp Arg Ala Gin Pro Gly 
35 40 45 



acc etc aaa cag ggc gag gca acc aag aag acc gga acc acc ate egg 192 
Thr Leu Lys Gin 61y Glu Ala Thr Lys Lys Thr Gly Thr Thr He Arg 
50 55 60 

ttc tgg gec gac teg gac ate ttt gag acc acc gaa tac gac ttc gag 240 
Phe Trp Ala Asp Ser Asp He Phe Glu Thr Thr Glu Tyr Asp Phe Glu 
65 70 75 80 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



acg gtg gcg egg 
Thr Val Ala Arg 



acc ate aac etc 
Thr lie Asn Leu 
100 

gac gtc gtc agt 
Asp Val Val Ser 
115 

gec gee gaa teg 
Ala Ala Glu Ser 
130 

tat eec ggc got 
Tyr Pro Gly Gly 
145 

agt ecg att cag 
Ser Pro He Gin 



cgc ctg cag 
Arg Leu Gin 
85 * 



ag atg gcg tte etc aac 
lu Met Ala Phe Leu Asn 
SO 



acc gac gag egg gtc acc ecg gac gag 
Thr Asp Glu Arg Val Thr Pro Asp Glu 
105 

gat acc gec gaa gca cca aag tec gec 
Asp Thr Ala Glu Ala Pro Lys Ser Ala 
120 125 

acc gcg ecg cac aag gtc aag age cgc 
Thr Ala Pro His Lys Val Lys Ser Arg 
135 140 

ttg gtc gat ttc gtc aag cac ate aac 
Leu Val Asp Phe Val Lys His He Asn 
150 155 

cag age ate gtc gac ttc sag ggc aag 
Gin Ser He Val Asp Phe Glu Gly Lys 
165 170 



aag ggc ctg 
Lys Gly Leu 
95 

gtc gtc gac 
Val Val Asp 
110 

cag gag aag 
Gin Glu Lys 



cac gaa gtc gaa 
His Glu Val Glu 
180 

gtg cac acc ttc 
Val His Thr Phe 
195 

■gaa gag ggc ttc 
Glu Glu Gly Phe 
210 

aaa gac aag aag 
Lys Asp Lys Lys 
225 

gac ate cgc gag 
Asp lie Arg Glu 



ate gcg atg cag tgg aac ggc ggc 
He Ala Met Gin Trp Asn Gly Gly 
185 



tac 

ly Gly Tyr 



gec aac acc ate aac acc cat gag ggt 
Ala Asn Thr He Asn Thr His Glu Gly 
200 205 

cgc agt gcg ttg acc teg gtg gtg aac 
Arg Ser Ala Leu Thr Ser Val Val Asn 



cgc acc aag 
Arg Thr Lys 
160 

ggc tec ggc 
Gly Ser Gly 
175 

teg gag teg 
Ser Glu Ser 
190 

gga acg cac 
Gly Thr His 



215 



220 



ctg etc aag gac aag gac ccc aac etc 
Leu Leu Lys Asp Lys Asp Pro Asn Leu 
230 235 

999 ttg gec gcg gtc ate teg gtg egg 
Gly Leu Ala Ala Val He Ser Val Arg 
245 250 



ecg cag ttc gag 
Pro Gin Phe Glu 
260 

aag teg ttt gtc 
Lys Ser Phe Val 
275 

Sag gec aat cct 
lu Ala Asn Pro 
290 



ggt cag acg aag acc aag ctg 
Gly Gin Thr Lys Thr Lys Leu 
265 



c aac 
y Asn 



cag aag gtt tgt aac gag cag etc acc 

Gin Lys Val Cys Asn Glu Gin Leu Thr 
280 285 

teg gaa gec aaa acc att gtg aac aag 

Ser Glu Ala Lys Thr He Val Asn Lys 

295 300 



gcg gta tec 
Ala Val Ser 



288 



336 



384 



acc ttc cac 432 
Thr Phe His 



480 



528 



576 



624 



aag tac gee 672 
Lys Tyr Ala 



acc ggt gac 720 
Thr Gly Asp 
240 

9tg gca gag 768 
Val Ala Glu 
255 

acc gag gtc 816 
Thr Glu Val 
270 

cac tgg ttc 864 
His Trp Phe 



912 



55 



54 



10 



15 



20 



25 



30 



45 
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teg gcg cag gca cgt etc gec gcg cgc aag gcg cga gag ttg gtg cgt 960 
Ser Ala Gin Ala Arg Leu Ala Ala Arg Lys Ala Arg Glu Leu Val Arg 
305 310* 315 320 

cgc aag age gca acc gat etc ggt 999 ctg ccc ggc aag ttg gec gac 1008 
Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 

tgc cgc teg aca gat ccg cgt aag teg gaa ctg tat gtg gtg gag ggt 1056 
Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

gac teg gec ggc ggc teg gca aag agt ggc cgc gat teg atg ttc cag 1104 
Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
355 360 365 

gcg ate ctg ccg ctg cgc ggc aag ate ate aat gtc gaa aag gca cgc 1152 
Ala lie Leu Pro Leu Arg Gly Lys lie lie Asn Val Glu Lys Ala Arg 
370 375 380 



ate gac cga gtc ctg aaa aac act gaa gtc cag 9eg ate ate acc gcg 1200 
lie Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 400 

ttg ggt acc ggt att eae gac gaa ttc gac etc teg aag ctg cgc tat 124S 
Leu Gly Thr Gly He His Asp Glu Phe Asp Leu Ser Lys Leu Arg Tyr 
405 410 415 

cac aag ate gtc ttg 1263 
His Lys He Val Leu 
420 

<210> 28 

<211> 421 

<212> PRT 

35 <213> Mycobacterium marinum 

<400> 28 

Gly Glu Asn Ser Gly Tyr Asn Val Ser Gly Gly Leu His Gly Val Gly 
1 5 10 15 

40 Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu 61u Val Asp He Lys 

20 25 30 

Arg Asp Gly Tyr Glu Trp Ser Gin Phe Tyr Asp Arg Ala Gin Pro Gly 
35 40 45 



Thr Leu Lys Gin Gly Glu Ala Thr Lys Lys Thr Gly Thr Thr lie Arg 
50 55. 60 

Phe Trp Ala Asp Ser Asp lie Phe Glu Thr Thr Glu Tyr Asp Phe Glu 

65 70 75 80 

Thr Val Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu 

85 90 95 
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w 



15 



20 



25 



30 



35 



40 



45 



50 



Thr lie Asn Leu Thr Asp Glu Arg Val Thr Pro Asp 61 u Val Val Asp 
100 105 110 

Asp Val Val Ser Asp Thr Ala Glu Ala Pro Lys Ser Ala Gin Glu Lys 
115 120 125 

Ala Ala 6lu Ser Thr Ala Pro His Lys Val Lys Ser Arg Thr Phe His 
130 135 140 

Tyr Pro Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys 
145 150 155 160 

Ser Pro He Gin Gin Ser He Val Asp Phe Glu Gly Lys Gly Ser Gly 
165 170 175 

His Glu Val Glu He Ala Met Gin Trp Asn Gly Sly Tyr Ser. Glu Ser 
180 185 190 

Val His Thr Phe Ala Asn Thr lie Asn Thr His 61u Gly Gly Thr His 
195 200 205 

Glu Glu Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
210 215 220 

Lys Asp Lys Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp 
225 230 235 240 

Asp He Arg Glu Gly Leu Ala Ala Val He Ser Val Arg Val Ala Glu 
245 250 255 

Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr G1u Val 
260 265 270 

Lys Ser Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe 
275 280 285 

Glu Ala Asn Pro Ser Glu Ala Lys Thr lie Val Asn Lys Ala Val Ser 
290 295 300 

Ser Ala Gin Ala Arg Leu Ala Ala Arg Lys Ala Arg Glu Leu Val Ar 
305 310 315 32! 

Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 

Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

Asp Ser Ala Gly Gly Ser Ala lys Ser Gly Arg Asp Ser Met Phe Gin 
355 360 365 

Ala He Leu Pro Leu Arg Gly Lys He lie Asn Val Glu Lys Ala Arg 
370 375 380 

He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He lie Thr Ala 
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385 390 395 400 

Leu 61 y Thr Gly He His" Asp Glu Phe Asp Leu Ser Lys Leu Arg Tyr 
405 410 415 

His Lys He Val Leu 
420 



<210> 29 
<211> 1257 
<212> DMA 

<213> Mycobacterium microti 



is <220> 

<221> COS 

<222> (1)..(1257) 

<400> 29 

teg gac gcg tat gcg ata tct'ggt got etg cac ggc gtc ggc gtg teg 48 
20 Ser Asp Ala Tyr Ala He Ser Gly Gly Leu His Gly Val Gly Val Ser 

1 5 10 15 

g^tg gtt aac gcg eta tec ace' egg etc gaa gtc gag ate aag cgc gac 96 

25 



Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Lys Arg Asp 
20 25 30 

99 tat gag tgg tct cag gtt tat gag aag teg gaa ccc etg ggc etc 144 
ly Tyr Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Leu Gly Leu 
35 40 45 

aag eaa ggg gcg ccg ace aag aag acg ggg tea acg gtg egg ttc tgg 192 
Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Ser Thr Val Arg Phe Trp 
50 55 60 

gec gac ccc get gtt ttc gaa ace acg gaa tac gac ttc gaa acc gtc 240 
as Ala Asp Pro Ala Val Phe G1u Thr Thr Glu Tyr Asp Phe Glu Thr Val 

65 70 75 80 



gee cgc egg etg eaa gag atg gcg ttc etc aac aag ggg etg acc ate 
Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu Thr He 
85 90 95 



288 



aac etg acc gac gag agg gtg acc eaa gac gag gtc gtc gac gaa gtg 336 

Asn Leu Thr Asp Glu Arg Val Thr Gin Asp Glu Val Val Asp Glu Val 
100 105 110 

45 gtc age gac gtc gec gag gcg ccg aag teg gca agt gaa cgc gca gec 384 

Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Arg Ala Ala 
115 120 125 



gaa tec act gca ccg cac aaa gtt aag age cgc acc ttt cac tat ccg 432 
Glu Ser Thr Ala Pro His Lys Val Lys Ser Arg Thr Phe His Tyr Pro 
130 135 140 

ggt ggc etg gtg gac ttc gtg aaa cac ate aac cgc acc aag aac gcg 480 
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15 



20 



25 



30 



35 



40 



45 



SO 



Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys Asn Ala 
145 150 . 155 160 



att cat age age ate gtg gac ttt tec -ggc aag ggc ace 
He His Ser Ser He Val Asp Phe Ser Gly Lys Sly Thr 



165 



170 



399 cac gag 
Sly His 61 u 
175 

teg gtg cac 



gtg gag ate gcg atg caa tgg aac gee ggg tat teg gag teg gtg ca 
Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val Hi 
180 185 190 

acc tte gee aac ace ate aac ace cac gag ggc ggc acc cac gaa gag 
Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His Glu Glu 
195 200 .205 

ggc ttc cgc age gcg ctg acg teg gtg gtg aac aag tac gee aag gac 
Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala Lys Asp 



cgc aag eta ctg aag gac aag gac ccc aac etc acc got gac gat ate 
Arg Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp Asp He 



225 

egg gaa 



230 



ic ctg gee get gtg ate teg 
Arg Glu Gly Leu Ala Ala Val He Ser VaT Lys Val Sir Glu Pro Gin 



245 



235 

aag gtc age 



250 



240 

ccg eag 
Pro 
255 



ttc gag ggc eag acc aag acc aag ttg ggc aac acc gag gtc aaa teg 
Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val Lys Ser 
260 265 270 

ttt gtg cag aag gtc tgt aac gaa cag ctg acc cac tgg ttt gaa gee 
Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu Ala 



275 



280 



285 



aac ccc acc gac teg aaa gtc gtt gtg aac aag get gtg tec teg gcg 
Asn Pro Thr Asp Ser Lys Val Val Val Asn Lys Ala Val Ser Ser Ala 
290 295 300 

caa gec cgt ate gcg gca cgt aag gca cga gag ttg gtg egg cgt aag 
Gin Ala Arg lie Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 
305 310 315 320 

age gee acc gac ate ggt gga ttg ccc ggc aag ctg gee gat tgc cgt 
Ser Ala Thr Asp He Gly Gly Leu Pro 61y Lys Leu Ala Asp Cys Arg 
325 330 335 

tec acg gat ccg cgc aag tec gaa ctg tat gtc gta gaa ggt gac teg 
Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly Asp Ser 



340 



345 



350 



528 
576 
624 
672 
720 
768 
816 
864 
912 
960 
1008 
1056 
1104 



gec ggc ggt tct gca aaa age ggt cgc gat teg atg ttc cag gcg ata 
Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala He 
355 360 365 

ctt ccg ctg cgc ggc aag ate ate aat gtg gag aaa gcg cgc ate gac 1152 
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Leu Pro Leu Arg Gly Lys He He Asn Val Glu Lys Ala Arg He Asp 
370 .375 380 

egg gts eta aag aac acc gaa gtt cag gcg ate ate acg gcg ctg age 1200 
Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala Leu Gly 
385 390 39S 400 

acc ggg ate eac gac gag ttc gat ate age aag ctg cgc tac cac aag 1248 
Thr Gly He His Asp Glu Phe Asp He Sly Lys Leu Arg Tyr His Lys 
405 410 415 

ate gtg ctg 1257 
He Val Leu 

<210> 30 

<211> 419 

<212> PRT 

<213> Mycobacterium microti 

20 <400> 30 

Ser Asp Ala Tyr Ala He Ser Sly Gly Leu His Gly Val Gly Val Ser 
15 10 15 



Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Lys Arg Asp 

20 25 30 

Gly Tyr Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Leu Gly Leu 
35 40 45 

Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Ser Thr Val Arg Phe Trp 
50 55 60 

Ala Asp Pro Ala Val Phe Glu Thr Thr Glu Tyr Asp Phe Glu Thr Val 

65 70 75 80 

Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu Thr He 

85 90 95 

Asn Leu Thr Asp Glu Arg Val Thr Gin Asp Glu Val Val Asp Glu Val 

100 105 110 



<° Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Arg Ala Ala 

115 120 125 

Glu Ser Thr Ala Pro His Lys Val Lys Ser Arg Thr Phe His Tyr Pro 
130 135 140 

Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys Asn Ala 
145 150 . 155 160 

He His Ser Ser He Val Asp Phe Ser Gly Lys Gly Thr Gly His Glu 
165 170 175 

Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val His 
180 185 190 . 
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Thr Phe 


Ala 
195 


Asn 


Thr lie 


Asn 


Thr 
200 


His 


Glu 


Gly Gly Thr His Glu 
205 


Glu 


61 y Phe 
210 


Arg 


Ser 


Ala 


Leu 


Thr 
215 


Ser 


Val 


Val 


Asn Lys Tyr Ala Lys 
220 


Asp 


Arg Lys 
225 


Leu 


Leu 


Lys 


Asp 
230 


Lys 


Asp 


Pro 


Asn 


Leu Thr Gly Asp Asp 
235 


He 
240 


Arg Glu 


Gly 


Leu 


Ala 
245 


Ala 


Val 


He 


Ser 


Val 
250 


Lys Val Ser Glu Pro 
255 


Gin 


Phe Glu 


Gly 


Gin 
250 


Thr 


Lys 


Thr 


Lys 


Leu 
265 


Gly 


Asn Thr Glu Val Lys 
270 


Ser 


Phe Val 


Gin 
275 


Lys 


Val 


Cys 


Asn 


Glu 
280 


Gin 


Leu 


Thr His Trp Phe Glu 
285 


Ala 


Asn Pro 
290 


Thr 


Asp 


Ser 


Lys 


Val 
295 


Val 


Val 


Asn 


Lys Ala Val Ser Ser 
300 


Ala 


Gin Ala 
305 


Arg 


He 


Ala 


Ala 
310 


Arg 


Lys 


Ala 


Arg 


Glu Leu Val Arg Ars 
315 


Lys 

320 


Ser Ala 


Thr 


Asp 


He 
325 


Gly 


Gly 


Leu 


Pro 


Gly 
330 


Lys Leu Ala Asp Cys 
335 


Arg 


Car Tk r- 

ser inr 


Asp 


Prr\ 

rrO 

340 


Arg 


Lys 


oer 


a IU 


Leu 
345 


iyr 


V/al lAal fcln lllu icn 

vai vaj uiu biy Asp 
350 


oer 


Ala Gly 


Sly 
355 


Ser 


Ala 


Lys 


Ser 


Gly 
350 


Arg 


Asp 


Ser Met Phe Gin Ala 
365 


He 


Leu Pro 
370 


Leu 


Arg 


Gly 


Lys 


lie 
375 


He 


Asn 


Val 


Glu Lys Ala Arg He 
380 


Asp 


Arg Val 
385 


Leu 


Lys 


Asn 


Thr 
390 


Glu 


Val 


Gin 


Ala 


He lie Thr Ala Leu 
395 


Gly 
400 


Thr Gly 


He 


His 


Asp 
405 


Glu 


Phe 


Asp 


He 


Gly 
410 


Lys Leu Arg Tyr His 
415 


Lys 



He Val Leu 



<210> 31 

<211> 1263 

<212> DNA 

<213> Mycobacterium asiaticun 

<220> 

<221> COS 

<222> CD. - (1263) 
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<400> 31 , go 

ggt gag aac age ggc tac acc gtc age §gt ggg ctg cac ggt gtc ggt 48 

Gly Glu Asn Ser Gly Tyr Thr Val Ser Gly Gly Leu His Gly Val Gly 
1 5 10 15 

gtg tea gtg gtc aac gcg ttg teg acc cga.ctc gag gtc 9ac ate aag 95 
Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asp He Lys 
20 25 30 

cgc gac ggg cac gag tgg tec cag tat tac gag cgc gee gtt cct ggc 144 
Arg Asp Gly His Glu Trp Ser Gin Tyr Tyr Glu Arg Ala Val Pro Gly 
35 40 45 



acg etc aag cag ggc gag acg acc aag aag acc ape acc acc ate egg 
Thr Leu Lye Gin Gly Glu Ala Thr Lys Lys Thr Gly Thr Thr He Arg 
50 55 60 



192 



ttc tgg gcg gac ceg gac ate ttc gag acc acc cag tac gac ttc gag 240 

Phe Trp Ala Asp Pro Asp lie Phe Glu Thr Thr Gin Tyr Asp Phe Glu 
65 70 75 80 

acg gtg gcg cgc egg etc caa gag atg gcg ttc ctg aac aag gac ttg 288 

Thr Val Ala Arg Ara Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu 
85 90 95 

25 acc ate aac ttg aec gac gag egg gtg gac cag gac gag gtc gtc gat 336 

Thr lie Asn Leu Thr Asp Glu Arg Val Asp Gin Asp Glu Val Val Asp 
100 105 110 

gaa gtc gtc age gac acc gec gat gcg ccc aag tec gec gaa gag aag 384 

Glu Val Val Ser Asp Thr Ala Asp Ala Pro Lys Ser Ala Glu Glu Lys 

115 120 125 

gcg gec gaa tec aaa gcg ceg cac aag gtt aag cac cgc acc ttc cac 432 

Ala Ala Glu Ser Lys Ala Pro His Lys Val Lys His Arg Thr Phe His 
130 135 140 

35 

tac ccc ggc ggc ttg gtc gac ttc gtc aag cac ate aac egg acc aag 480 

Tyr Pro Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys 
145 150 155 160 

age ceg ate caa cag age gtc ate gac ttc gag ggc aaa ggc acc ggc 528 

4° Ser Pro He Gin Gin Ser Val He Asp Phe Glu Gly Lys Gly Thr Gly 

165 170 175 

cac gag gtc gag ate gcg atg cag tgg aac ggt ggc tac teg gag teg 576 

His Glu Val Glu He Ala Met Gin Trp Asn Gly Gly Tyr Ser Glu Ser 
180 185 190 

gtg cac ace ttc gec aac acg ate aac acc cae gag ggc ggt acg cac 624 

Val His Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His 

195 200 205 

gaa gaa ggg ttc cgc agt gcg ctg acg teg gtg gtg aac aaa tac gec 672 

Glu Glu Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
210 215 220 



55 



61 



EP 1 098 003 A2 



15 



20 



25 



30 



35 



40 



45 



50 



aaa gac aag aag ctg ctg aaa gac aag gac ccg aac etc acc got gac 
Lys Asp Lys Lys Leu Leu tys Asp Lys Asp Pro Asn Leu Thr Gly Asp 
225 230 235 240 



gac ate cgc gag gga ctg gec gcg gtg ate teg gtc aag gtc gec gaa 
jly Leu Ala Ala Val He Ser Val Lys Val Ala 61u 



Asp He Arg Glu 



245 



250 



255 



ccc cag. ttc gag ggc cag aca aag acc aag ctg gac aac acc gag gtc 
Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val 
260 265 270 

?* 9 £ cs £ c 8*? S? 9 ? ag 9t f t9C aac saa cag etc acc cac tgg ttc 
Lys Ser Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe 
275 280 285 

gag gee aat ccg teg gaa gee aaa acc gtt gtc aac aag gcg gtt teg 
Glu Ala Asn Pro Ser Glu Ala Lys Thr Val Val Asn Lys Ala Val Ser 
290 295 300 

tec gca cag gec egg ate gcg gcg egg aag gee cga gag ttg gtg egg 
Ser Ala Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg 
305 310 315 320 

cgc aag age gcg acc gat ttg ggc ggg ctg ccc ggc aag ctg gee gac 
Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 



325 



330 



335 



tgc cgt tec acc gac ccg cgc aag tec gaa ctg tat gtg gtg gag ggt 
Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

gac teg gca ggt ggc teg gec aag age ggc cgt gac teg atg ttc cag 
Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 



gee ate ctg ccg ctg cgc ggc aag ate ate aac gtc gag aag gec cgc 
Ala He Leu Pro Leu Arg Gly Lys lie He Asn Val Glu Lys Ala Arg 
370 375 380 

ate gac egg gtc ctg aag aac acc gaa gtc eag gcg ate ate acc gcg 
He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 400 

ctg ggt acc ggt att cac gac gag ttc gac att tct aaa ctg cgt tac 
Leu Gly Thr Gly lie His Asp Glu Phe Asp He Ser Lys Leu Arg Tyr 
405 410 415 

cac aag ate gtg ttg 
His Lys He Val Leu 
420 



720 

768 

816 

864 

912 

960 

1008 

1056 

1104 

1152 

1200 

1248 

1263 



<210> 32 
<211> 421 
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<212> PRT 

<213> Mycobacterium asiaticuo 














<400> 32 

Gly Glu Asn Ser Gig Tyr Thr Val 


Ser 


10 


Gly 


Leu 


His 


Gly Val Gly 
15 


Val Ser 


val val Asn Aia Leu oer 
20 


Thr 
25 


Arg 


Leu 


Glu 


Val 


Asp He Lys 
30 


Arg Asp 


Gly His Blu Trp 5er Gin 
35 40 


Tyr 


Tyr 


Glu 


Arg 


Ala 
45 


Val Pro Gly 


Thr Leu 
50 


lys 6ln 61 y siu Ala Tnr 
55 


Lys 


Lys 


Thr 


Gly 


Thr 


Thr lie Arg 

fill A> 1 ^0 m \ f Q 


Phe Trp 
65 


Ala Asp Pro Asp lie rne 


Glu 


Thr 


Thr 
75 


Gin 


Tyr 


Asp Phe Glu 
80 


Thr Val 


Ala Arg Arg Leu Bin biu 
85 


Met 


Ala 
90 


Phe 


Leu 


Asn 


Lys 61 y Leu 
95 


Thr lie 


Asn Leu inr Asp uiu Arg 
100 


Val 
105 


Asp 


Gin 


Asp 


Glu 


Val Val Asp 
110 


Glu val 


vai o&r asp inr Aia Asp 
115 120 


Ala 


Pro 


Lys 


Ser 


Ala 
125 


Glu Glu Lys 


Ala Ala 

130 


uiu der Lys Mia rro ms 
135 


Lys 


Val 


Lys 


His 
140 


Arg 


Thr Phe His 


Til ^ Pf-A 

145 


u i y u i y l©u va i asp rne 
150 


Val 


Lys 


His 
155 


lie 


Asn 


Arg Thr Lys 
160 


Car Prn 


Hp gin Gin Ser Val Tie 
4 1V miiv uiu <3cr vai lie 

165 


Asp 


Phe 
170 


Glu 


Gly 


Lys 


Gly Thr Gly 
175 


His Glu 


Val Glu He Ala Met Gin 
180 


a 


Asn 


Gly 


Gly 


Tyr 


Ser Glu Ser 
190 


Val His 


Thr Phe Ala Asn Thr He 
195 200 


Asn 


Thr 


His 


Glu 


Gly 
205 


Gly Thr His 


Glu Glu 
210 


Gly Phe Arg Ser Ala Leu 
215 


Thr 


Ser 


Val 


Val 
220 


Asn 


Lys Tyr Ala 


Lys Asp 
225 


Lys Lys Leu Leu Lys Asp 
230 


Lys 


Asp 


Pro 
235 


Asn 


Leu 


Thr Gly Asp 
240 


Asp lie 


Arg Glu Gly Leu Ala Ala 
245 


Val 


He 
250 


Ser 


Val 


Lys 


Val. Ala Glu 
255 


Pro Gin 


Phe Glu Gly Gin Thr Lys 
260 


Thr 
265 


Lys 


Leu 


Gly 


Asn 


Thr Glu Val 
270 


Lys Ser 


Phe Val Gin Lys Val Cys 


Asn 


Glu 


Gin 


Leu 


Thr 


His Trp Phe 
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so 
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275 280 285 

Glu Ala Asn Pro Ser 61 u Ala Lys Thr Val Val Asn Lys Ala Val Ser 
290 295 300 

Ser Ala Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val An 
305 .310 315 32i 

Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 

Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

»s Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 

355 360 365 

Ala He Leu Pro Leu Arg Gly Lys He He Asn Val Glu Lys Ala Arg 
370 375 380 



He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He lie Thr Ala 
385 390 395 400 

Leu Gly Thr Gly He His Asp Glu Phe Asp He Ser Lys Leu Arg Tyr 
405 410 415 

His Lys He Val Leu 
420 



ao <210> 33 

<211> 1263 
<212> DNA 

<213> Mycobacterium scrofulceum 

<220> 
35 <221> CDS 

<222> (1). .(1263) 



<400> 33 
ggc gag ; 

40 Gly Glu Asn Ser Gly Tyr Thr Val Ser Gly Gly Leu His Gly VaT GTy 



ggc gag aac age ggc tac acc gtc age ggt ggg ttg cac gga gtg ggc 48 
" " " fly Gly Leu His Gly Val Gly 

1 5 10 15 



gtg teg gtg gtc aac gcg ctg tec acc cgc ctg gag gtc acc ate aag 96 

Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu 61u Val Thr He Lys 
u 20 25 30 

cgc gac ggg cac gag tgg ttt cag tac tac gac cgc gec gtg ccc gga 144 

Arg Asp Gly His Glu Trp Phe Gin Tyr Tyr Asp Arg Ala Val Pro 61 y 
35 40 45 

so acc etc aag cag ggc gag gee acc aag aag ace gga acc acg ate agg 192 

Thr Leu Lys Gin Gly Glu Ala Thr Lys Lys Thr Gly Thr Thr He Arg 
50 55 60 



55 



64 



EP 1 098 003 A2 



10 



15 



20 



25 



30 



ttc tgg 
Phe Trp 
65 

acc gtg 
Thr Val 



acc ate 
Thr He 



gag gtc 
Glu Val 



gec gcg 
Ala Ala 
130 

tac ccc 
Tyr Pro 
145 

age ccg 
Ser Pro 



gcg gac ccc 
Ala Asp Pro 



gcg egg egg 
Ala Arg An 
8 

aac etc acc 
Asn Leu Thr 
100 

gtc age gac 
Val Ser Asp 
115 

gaa teg act 
Slu Ser Thr 



gaa ate 

Glu He 

•70* 

etg cag 
Leu Gin 



ttc gaa acc 
Phe Glu Thr 



gac gaa 
Asp Glu 



aec gec 
Thr Ala 



?c ggt etg 
y Gly Leu 



cac gag 
His Glu 



ate cag cag 
He Gin Gin 
165 

gtc gag ate 
Val Glu lie 
180 



gcg cca 
Ala Pro 
135 

gtc gac 
Val Asp 
150 

age gtc 
Ser Val 



gec atg 
Ala Met 



gag atg gec 
Glu Met Ala 
90 

cga gtg gag 
Arg Val Glu 
105 

9ag gca ccg 
Glu Ala Pro 
120 

cac aag gtc 

His Lys Val 



aca cag tac gac ttc 
Thr Gin Tyr Asp Phe 
75 



ttc etc aac aag ggc 
Phe Leu Asn Lys Gl 



cag gac 
Gin Asp 



gag gtc gtc 
Slu Val Val 
110 



aag tec gec gaa 
Lys Ser Ala Glu 
125 



s 



sag 240 
Glu 
80 

etc 288 
Leu 



ac 336 
sp 



aag 384 

Lys 



ttc gtc aag 
Phe Va1 Lys 



ate gat ttc 
He Asp Phe 
170 

cag tgg aac 
Gin Trp Asn 
185 



aag cac egc acc ttc 
Lys His Arg Thr Phe 
140 

cac ate aac cgc acc 
His lie Asn Arg Thr 
155 

gac ggc aag ggc acc 
Asp Ely Lys Gly Thr 
175 

?c ggc .tac teg gag 
y Gly Tyr Ser Glu 
190 



cac 432 

His 



aag 480 

Lys 

160 

m 



528 



tec 576 
Ser 



35 



40 



50 



gtc cac 
Val His 



gag gag 
Glu Glu 
210 

aaa gac 
Lys Asp 
225 

gac ate 
Asp He 



acc ttc gec 
Thr Phe Ala 
195 

ac ttc cgc 
ly Phe Arg 



aag aaa etg 
Lys Lys Leu 



cca cag 
Pro Gin 



aag teg 
Lys Ser 



cgt gag age 
Arg Glu Gly 
245 

ttc gaa ggc 
Phe Glu Gly 
260 

ttc gtg cag 
Phe Val Gin 
275 



aac acc 
Asn Thr 



age gcg 
Ser Ala 
215 

etg aag 
Leu Lys 
230 

ttg gec 
Leu Ala 



ate aac acg 
He Asn Thr 
200 

etg acg teg 
Leu Thr Ser 



cac gag ggc ggc acc 
His Glu Gly Gly Thr 
205 

gtg gtg aac aag tac 
Val Val Asn Lys Tyr 
220 



gac aaa gat 
Asp Lys Asp 



ccc aac etc acc ggt 
Pro Asn Leu Thr Sly 
235 



cac 624 

His ' 



gec 672 
Ala 



gac 720 
Asp 



cag acc 
G\n Thr 



aag gtg 
Lys Val 



gcg gtc ate 
Ala Val He 
250 

aag aca aag 
Lys Thr Lys 
265 

tgc aac gag 
Cys Asn Glu 
280 



teg gt< 
Ser Va 



aag gtc gee 
Lys Val Ala 
255 



240 

gag 

Glu 



gag 768 



etg ggc aac acc gag 
Leu Gly Asn Thr Glu 
270 

cag etc acc cac tgg 
Gin Leu Thr His Trp 
285 



816 



ttc 864 
Phe 



65 
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gag gec aac cca tec gag ocg aaa acg gtg gtg aac aaa gcg gtg teg 912 
Glu Ala Asn Pro Ser Glu Ala Lys Thr VaT VaT Asn Lys Ala Val Ser 



290 295 . 300 

teg get cag gcg egc att gec gee ege aag gcg cgt gaa ctg gtg cgc 960 
Ser Ala Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg 
305 310 315 320 

cgc aag age gec aec gae etc gge ggt ctg ccc ggg aag ctg gec gac 1008 
Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp 
325 330 335 

tgc cgc tec acc gae ccg egg aaa teg gaa ctg tat gtg gtg gag gge 1056 
Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val VaT Glu Gly 
340 345 350 

gat teg gec gge gge teg gec aag age aag cgc gac teg atg ttc cag 1104 
Asp Ser Ala Gly GTy Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
355 360 365 

gcg ate ctg ccg ctg cgc gge aag ate ate aat gtc gag aag gec cgc 1152 
Ala lie Leu Pro Leu Arg GTy Lys lie He Asn Val Glu Lys Ala Arg 
370 375 380 

ate gac egg gtg ctg aag aac acc gaa gtt cag gcg ate ate acc gcg 1200 
He Asp Arg VaT Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 • • 390 395 400 

ctg ggt aec ggg att cac gac gag ttc gac ate acc aag ctg cgc tat 1248 
Leu Gly Thr Gly He His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 
405 410 415 

cac aag ate gtg ctg 1263 
His Lys He Val Leu 
420 

<210> 34 
<211> 421 . 
<212> PRT 

<213> Mycobacterium scrofulceum 

40 <400> 34 

Gly Glu Asn Ser Gly Tyr Thr Val Ser Gly Gly Leu His Gly Val Gly 
15 10 15 



Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Thr lie Lys 

20 25 30 

Arg Asp Gly His Glu Trp Phe Gin Tyr Tyr Asp Arg Ala Val Pro Gly. 

35 40 45 

Thr Leu Lys Gin Gly Glu Ala Thr Lys Lys Thr Gly Thr Thr He Arg 

so 50 55 60 

Phe Trp Ala Asp Pro Glu lie Phe Glu Thr Thr Gin Tyr Asp Phe Glu 



» 



66 
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50 



65- 



70 



75 



80 



Thr Val Ala Arg Arg leu Gin Glu Net Ala 
85 90 

Thr He Asn Leu Thr Asp Glu Arg Val Glu 
100 105 

Glu Val Val Ser Asp Thr Ala Glu Ala Pro 
115 120 

Ala Ala Glu Ser Thr Ala Pro His Lys Val 
130 135 

Tyr Pro Gly Gly Leu Val Asp Phe Val Lys 
145 150 

Ser Pro He Gin Gin Ser Val He Asp Phe 
165 . 170 



His Glu Val Glu He Ala Met Gin Trp Asn 
180 185 



Val His Thr Phe Ala Asn Thr He Asn Thr 
195 200 

Glu Glu Gly Phe Arg Ser Ala Leu Thr Ser 
210 215 

Lys Asp Lys Lys Leu Leu Lys Asp Lys Asp 
225 230 

Asp He Arg Glu Gly Leu Ala Ala Val He 
245 . 250 

Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys 
260 265 

Lys Ser Phe Val Gin Lys Val Cys Asn Glu 
275 280 

Glu Ala Asn Pro Ser Glu Ala Lys Thr Val 
290 295 

Ser Ala Gin Ala Arg He Ala Ala Arg Lys 
305 310 

Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu 
325 330 

Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu 
340 345 



Phe Leu Asn Lys Gig Leu 

Gin Asp Glu Val Val Asp 
110 

Lys Ser Ala Glu Glu Lys 
125 

Lys His Arg Thr Phe His 
140 

His He Asn Arg Thr Lys 
155 160 

Asp Gly Lys Gly Thr Gly 
175 

Gly Gly Tyr Ser Glu Ser 
190 

His Glu Gly Gly Thr His 
205 

Val Val Asn Lys Tyr Ala 
220 

Pro Asn Leu Thr Gly Asp 
235 240 

Ser Val Lys Val Ala Glu 
255 

Leu Gly Asn Thr Glu Val 
270 

Gin Leu Thr His Trp Phe 
285 

Val Asn Lys Ala Val Ser 
300 

Ala Arg Glu Leu Val Arg 
315 320 

Pro Gly Lys Leu Ala Asp 
335 

Leu Tyr Val Val Glu Gly 
350 



Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
355 360 365 
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Ala He Leu Pro Leu Arg Gly Lys lie He Asn Val Glu Lys Ala Arg 
370 . 375 380 

He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 400 

Leu Gly Thr Gly He His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 
405 410 415 



10 His Lys He Val Leu 

420 



<210> 35 
y 5 <211> 1260 



<212> DMA 
<213> Mycobacterium branderi 



<220> 
<221> COS 
» <222> (1)..(1260) 



<400> 35 

ggc gat gac age gec tac gcg gtc teg gat ggt ctg cac ggc gtg ggc 48 
Gly Asp Asp Ser Ala Tyr Ala Val Ser Gly Gly Leu His Gly Val Gly 
15 10 15 

gtg teg gtg gtc aac gca ttg teg act cga etc gag gtg gag ate geg 96 
Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Ala . 
20 25 30 

ace gac §gg tac gag tgg ttt cag cat tac gac cgc tct gtc ccc aac 144 
Thr Asp Gly Tyr Glu Trp Phe Gin His Tyr Asp Arg Ser Val Pro Gly 
35 40 45 

acg etc aag caa ggc gag aaa acc aaa aag acc gac acc acg gtc cgc 192 
Thr Leu Lys Gin Gly Glu Lys Thr Lys Lys Thr Gly Thr Thr Val Arg 
50 55 60 

ttc tgg gec gac ccg gac ate ttc gag acg acg gat tac gac ttc. gag 240 
Phe Trp Ala Asp Pro Asp lie Phe Glu Thr Thr Asp Tyr Asp Phe Glu 
65 70 75 80 



acg gtc gca cgc egg ctg cag gaa atg gcg ttc etc aac aaa ggg ctg 288 
Thr Val Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu 
85 90 95 



acc ate aac ctg acc gac gag egg gtg cga aac gaa gaa gtc gtc gac 336 

Thr He Asn Leu Thr Asp Glu Arg Val Arg Asn Glu Glu Val Val Asp 
100 .105 1.10 

gag gtc gtc age gac acc gec gac gcg ccg aag teg gcg cgc gaa gag 384 

Glu Val Val Ser Asp Thr Ala Asp Ala Pro Lys Ser Ala Arg Glu Glu 

115 120 125 

gec gaa gaa egg acc acg cag aaa gtc aag cac cgc acg ttc cat tac 432 
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25 
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50 



Ala Glu Glu 
130 

ccc ggc ggc 
Pro Gly Gly 
145 

ccc ate cat 
Pro He His 



gag gtc gag 
Glu Vai Glu 



cac acc ttc 
His Thr Phe 
195 



gaa ggg ttc 
Glu Gly 
210 



Phe 



gac cga aaa 
Asp 
225 



Arg Lys 



att cgt gag 
lie Arg Glu 



Arg Thr 



ttg gtc 
Leu Vai 



teg age 
Ser Ser 
165 

ate gca 
lie Ala 
180 

gcc aac 
Ala Asn 



cgc gcg 
Arg Ala 



ctg ctg 
Leu Leu 



ggc ctg 
Gly Leu 
245 



Thr Gin 
J35 

gat ttc 
Asp Phe 
150 

ate gtc 
He Vai 



Lys Vai Lys His Arg Thr Phe His Tyr 

UO * 



gtc aaa 
Vai Lys 



gac ttc 
Asp Phe 



cag ttc gag 
Gin Phe Glu 



teg. ttt gtg 
Ser Phe Vai 
275 

gcc aat ccc 
Ala Asn Pro 
290 

gcg cag gcc 
Ala Gin Ala 
305 

aag age gca 
Lys Ser Ala 



cgc teg ace 
Arg Ser Thr 



?! 



c cag 
y Gin 
260 



cag aag 
Gin Lys 



atg cag 
Met Gin 



acc ate 
Thr He 



gca ctg 
Ala Leu 
215 

aag gac 
Lys Asp 
230 

gcg gcg 
Ala Ala 



acc aaa 

Thr Lys 



gtc tgc 
Vai Cys 



age gac 
Ser Asp 



cgc att 
Arg He 

acc gat 
Thr Asp 
325 

gat cca 
Asp Pro 
340 



gee aag 
Ala Lys 
295 

gcc gcc 
Ala Ala 
310 



tgg aac 
Trp Asn 
185 

aac acc 
Asn Thr 
200 

acg tec 
Thr Ser 



aag gac 
Lys Asp 



gtc ate 
Vai He 



acc aaa 
Thr Lys 
265 

aac gaa 
Asn Glu 
280 

ace gtc 
Thr Vai 



cac ate 
His He 
155 

tec ggc 
Ser Gly 
170 

K S!5 



cac gag 
His Glu 



cgc aaa 
Arg Lys 



Leu 



Kit 



g ctg 
y Leu 



cgc aag 
Arg Lys 



tec gaa 
Ser Glu 
345 



ccc aac 
Pro Asn 
235 

teg gtc 
Ser Vai 
250 

etc ggc 
Leu Gly 



cag etc 
Gin Leu 



gtc aac 
Vai Asn 



gcg cga 
Ala Arg 
315 

ccg QQC 
Pro Gly 
330 

ttg tat 
Leu Tyr 



aac cgc 
Asn Arg 



tat teg 
Tyr Ser 



Gl y Gl y 
205 

aac aag 

Asn Lys 
220 

etc acc 
Leu Thr 



aca aag aac 
Thr Lys Asn 
160 



ccc ggc cac 
Gly His 



Pro 



175 

gag teg gtg 
Glu Ser Vai 
190 

acc cac gaa 
Thr His Glu 



tac gcc aag 
Tyr Ala Lys 



I? 



c gac gac 
y Asp Asp 
240 



aag gtc 
Lys Vai 



aac acc 
Asn Thr 



acc cac 
Thr His 
285 

aaa gcg 
Lys Ala 
300 . 

gaa ttg 
61 u Leu 



aag ctg 
Lys Leu 



gtg gtg 

il Vai 



age gag ccg 
Ser Glu Pro 
255 

gaa gtc aag 
Glu Vai Lys 
270 

tgg ttc gag 
Trp Phe Glu 



gtg teg teg 
Vai Ser Ser 



gtg cgc cgc 
Vai Arg An 
32 



get gac tgc 
Ala Asp Cys 
335 



gag ggt gat 
Glu Gly Asp 
350 



480 

528 

576 

624 

572 

720 

768 

816 

864 

912 

960 

1008 

1056 



Vai 

teg gcc ggc ggc teg gcc aag age ggc cgc gac teg atg ttt cag gcg 1104 
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Ser Ala Sly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala 
355 360 365 

ate ctg ccg tig egg ggc aag ate ate aac gtg gag aag gee cgc ate 1152 
He Leu Pro Leu Arg Gly Lys He He Asn Val 61u Lys Ala Arg He 
370 375 380 

gac egg gtg ctg aag aac act gag gtg cag gcg ate ate ace gcg ctg 1200 
Asp Arg Val Leu Lys Asn Thr 61u Val Gin Ala He He Thr Ala Leu 
385 390 395 400 

ggc acc sag att cac gac gag ttc gac ate tec aag ctg cgc tac cac 1248 
Gly Thr Sly He His Asp Glu Phe Asp He Ser Lys Leu Arg Tyr His 
405 . 410 415 



aag ate gtg ctg 1260 
q Lys He Val Leu 

M 420 

<210> 36 
<211> 420 
<212> PRT 

<213> Mycobacterium branderi 

25 <400> 36 

Gly Asp Asp Ser Ala Tyr Ala Val Ser Gly Gly Leu His Gly Val Gly 
15 10 15 

Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Ala 

so 20 25 30 

Thr Asp Gly Tyr Glu Trp Phe Gin His Tyr Asp Arg Ser Val Pro Gly 
35 40 45 



Thr Leu Lys Gin Gly Glu Lys Thr Lys Lys Thr Gly Thr Tlir Val Arg 
50 55 60 

Phe Trp Ala Asp Pro Asp He Phe Glu Thr Thr Asp Tyr Asp Phe Glu 
65 70 75 80 

<o Thr Val Ala Arg Arg Leu Gin Glu Het Ala Phe Leu Asn Lys Gly Leu 

85 90 95 

Thr He Asn Leu Thr Asp Glu Arg Val Arg Asn Glu Glu Val Val Asp 
100 105 110 



Glu Val Val Ser Asp Thr Ala Asp Ala Pro Lys Ser Ala Arg Glu Glu 
115 120 125 

Ala Glu Glu Arg Thr Thr Gin Lys Val' Lys His Arg Thr Phe His Tyr 
130 135 140 

Pro Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys Asn 

145 150 155 160 
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Pro He His Ser Ser lie Val Asp Phe Ser Gly Lys Gly Pro Gly His 
165 . . 170 175 

Glu Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val 
180 185 190 

His Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His Glu 
195 200 205 

Glu Gly Phe Arg Ala Ala Leu Thr Ser Val Val Asn Lys Tyr Ala Lys 
210 215 220 

Asp Arg Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp Asp 
225 230 235 240 

He Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ser Glu Pro 
245 250 255 

Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val Lys 
260 265 270 

Ser Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu 
275 280 285 

Ala Asn Pro Ser A6p Ala Lys Thr Val Va1 Asn Lys Ala Val Ser Ser 
290 295 300 

Ala Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg 
305 310 315 320 

Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp Cys 
325 330 335 

Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly Asp 
340 345 350 

Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala 
355 360 365 

He Leu Pro Leu Arg Gly Lys He lie Asn Val Glu Lys Ala Arg He 
370 375 380 

Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala Leu 
385 390 395 ' 400 

Gly Thr Gly He His Asp Glu Phe Asp He Ser Lys Leu Arg Tyr His 
405 410 415 

Lys lie Val Leu 
420 



<210> 37 
<211> 1263 
<212> DNA 

<213> Mycobacterium paratuberculosis 



71 



EP 1 098 003 A2 



10 



15 



25 



30 



35 



40 



45 



50 



<220> 

<221> COS 

<222> (1)..(t263) 

<400> 37 

ggc gag aac age ggc 
Gly 61 u Asn Ser GT: 



tac aac 
Tyr Asn 



gtc age 
Val Ser 



agt ctg cac ggc 
Gly Leu His Gly 



gtc teg gtg gtc aac 
Val Ser Val Val Asn 
20 

cgc gac ggc tac gag 
Arg Asp Gly Tyr 61 u 
35 



acc etc aag cag go 
Thr Leu Lys Gin Gl; 
50 



ttc tgg gcc gac ccc 
Phe Trp Ala Asp Pro 
65 

acg gtg gcc egg egg 
Thr Val Ala Arg Arg 
85 

acc ate aac etc acc 
Thr He Asn Leu Thr 
100 

gag gtg gtc age gac 
Glu Val Val Ser Asp 
115 

gcg gcg gaa teg get 
Ala Ala Glu Ser Ala 
130 

tac ccc ggc ggc ctg 
Tyr Pro Gly Gly Leu 
145 

aac ccc ate cac cag 
Asn Pro He His Gin 
165 

cac gag gtc gag ate 
His filu Val Glu He 
180 

gtg cac acc ttc gcc 
Val His Thr Phe Ala 



gcg ctg 
Ala Leu 



tgg teg 
Trp Ser 



lag gcc 
lu Ala 
55 

gac ate 
Asp lie 
70 

ctg cag 
Leu 61 n 



tec act egg 
Ser Thr Arg 
25 

cag tac tac 
Gin Tyr Tyr 
40 

ace aag cgc 
Thr Lys Arg 



etc gag g 
Leu Glu V 



tc aac 
al Asn 
30 



gtc ggc 48 
Val Gly 
15 

ate gcc 96 
He Ala 



Jac cac gcc gtg 
sp His Ala Val Pro Gly 



Thr 



ttc gag acc 
Phe Glu Thr 



gac gag 
Asp Glu 



acc gcc 
Thr Ala 



gcg ccg 
Ala Pro 
135 

gtc gac 
Val Asp 
150 

age ate 
Ser He 



gaa atg geg 
Glu Net Ala 
90 

egg gtg acc 
Arg Val Thr 
105 

gac gca ccc 
Asp Ala Pro 
120 

cat aag gtc 
His Lys Val 



ttc gtc aaa 
Phe Val Lys 



acc gag 
Thr Glu 
75 

ttc etc 
Phe Leu 



45 

acc acc 
Thr Thr 



tac gac 
Tyr Asp 



aac aag 
Asn Lys 



ccc ggc 144 

il> 



ate egg 192 
He Arg 



ttc gaa 240 
Phe Glu 
80 



g^c ctg 288 



Leu 



gcg atg 
Ala Met 



aac acc 
Asn Thr 



ate gat ttc 
lie Asp Phe 
170 

cag tgg aac 
Gin Trp Asn 
185 

ate aac acg 
He Asn Thr 



aac gaa gag gtc 
Asn Glu Glu Val 
110 

aag teg gcg cag 
Lys Ser Ala Gin 

125 . 

aag cac cgc acc 

Lys His Arg Thr 
140 

cac ate aat cgc 

His He Asn Arg 
155 

ft ggg aag ggc 

y Gly Lys Gly 



ggc ggc tac tec 
Gly Gly Tyr Ser 
190 

cac gag ggc ggc 

His Glu Gly Gly 



gtc gac 336 
Val Asp 



gag aag 384 
Glu Lys 



ttc cac 432 
Phe His 



acc aaa 480 
Thr Lys 
160 

ccc ggc 528 
Pro Gly 
175 

gaa teg 576 
Glu Ser 



acc cac 524 
Thr His 



55 
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205 



10 



15 



20 



25 
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35 
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45 



SO 



gag gag ggc ttc cgc age gcg ctg acc tec gtg gtc aac aag tac gee 

Glu 61 u Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
210 215 220 

aag gac aag aag ctg etc aag gac aag gac ccc aac ctg acc ggt gac 

Lys Asp Lys Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp 



225 



230 



235 



240 



gac ate cgc gag ggt ttg gec gcg gtg ate teg gtc aag gtg age gaa 
Asp He Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ser Glu 
245 250 255 



ccg cag ttc gag ggc eag ace aag acc aaa ctg 
Pro 61 n Phe Glu Gly Gin Thr Lys Thr Lys Leu 
260 265 



fc aac aec gag gt 
y Asn Thr Glu Va 
270 



aag teg ttc gtg cag aag gtg tgc aac gaa eag etc acc cac tgg.ttc 
Lys Ser Phe Val Gin Lys Val Cys Asn Glu 61 n leu Thr His Trp Phe 



275 



280 



285 



gaa gee aac ccc gca gac gee aaa gtc att gtc aac aag gcg gtt tea 
Glu Ala Asn Pro Ala Asp Ala Lys Val He. Val Asn Lys Ala Val Ser 
290 295 300 



tea gcg cag 
Ser Ala Gin 
305 



eg cgc ate gec gcg cgc aag gcg cga 
la Arg He Ala Ala Arg Lys Ala Arg 
310 315 



jag ttg gtg cgc 
>1u Leu Val Arg 
320 



cgc aag age gca acc gac ctg ggc ggg ctg ccc ggc aag etc gee gac 
Arg Lys Ser Ala Thr Asp Leu Gly Gly Leu Pro .Gly Lys Leu Ala Asp 
325 330 335 

tgc egg teg acc gac ccg cgc aag teg gaa ttg tat gtg gtc gag ggt 
Cys Arg Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly 
340 345 350 

gac teg gee ggc ggc teg gcg aaa age ggc egg gae teg atg ttc cag 
Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin 
355 360 365 

gee ate ctt ccg ctg cgc ggc aag ate ate aac gtc gaa aag gec cgc 
Ala He Leu Pro Leu Arg 61y Lys He He Asn Val Glu Lys Ala Arg 
370 375 380 

ate gac egg gtg ctg aag aac acc gag gtg cag gcg ate ate acc gcg 
He Asp Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala 
385 390 395 400 

ctg ggc acc ggg att cac gac gag ttc gac ate acc aag ctg cgc tac 
Leu Gly Thr Gly He His Asp Glu Phe Asp He Thr Lys Leu Arg Tyr 



405 

cac aag ate gtg ttg 
His Lys He Val Leu 



410 



415 



672 

720 

768 

816 

864 

912 

960 

1008 

1056 

1104 

1152 

1200 

1248 

1263 



55 
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420 



<210> 38 
<211> 421 
<212> PRT 

<213> Mycobacterium paratuberculosis 
<400> 38 

Gly Glu Asn Ser Gly Tyr Asn Val Ser Gly Gly Leu His Gly Val Gly 
15 10 15 

Val Ser Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Asn He Ala 
20 25 30 

Arg Asp Gly Tyr Glu Trp Ser Gin Tyr Tyr Asp His Ala Val Pro Gly 
35 40 45 

Thr Leu Lys Gin Gly Glu Ala Thr Lys Arg Thr Gly Thr Thr He Arg 
50 55 60 

Phe Trp Ala Asp Pro Asp He Phe Glu Thr Thr Glu Tyr Asp Phe Glu 
65 70 75 80 

Thr Val Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu 
85 90 95 

Thr He Asn Leu Thr Asp Glu Arg Val Thr Asn Glu Glu Val Val Asp 
100 105 110 

Glu Val Val Ser Asp Thr Ala Asp Ala Pro Lys Ser Ala Gin Glu Lys 
115 120 125 

Ala Ala Glu Ser Ala Ala Pro His Lys Va1 Lys His Arg Thr Phe His 
130 135 140 

Tyr Pro Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys 
145 150 155 160 

Asn Pro He His Gin Ser He He Asp Phe Gly Gly Lys Gly Pro Gly 
165 170 175 

His Glu Val Glu He Ala Met Gin Trp Asn Gly Gly Tyr Ser Glu Ser 
180 185 190 

Val His Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His 
195 200 205 

Glu Glu Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala 
210 215 220 • 

Lys Asp Lys Lys Leu Leu Lys Asp Lys Asp Pro Asn Leu Thr Gly Asp 
225 230 235 240 

Asp He Arg Glu Gly Leu Ale Ala Val He Ser Val Lys Val Ser Glu 



245 



250 



255 
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Pro 


Gin 


Phe 


61 u 
260 


Gly Gin Thr Lys Thr 
265 


Lys 


Leu Gly Asn 


Thr Glu 
270 


Val 


Lys 




275 


Vol 
Va 1 


u j n Lys va i wjs /ion 

280 


u 1 U 


uin Leu inr 
285 


His Ira 


Phe 


Glu 


Ala 
290 


Asn 


Pre 


Ala Asp Ala Lys Val 
295 


He 


Val Asn Lys 
300 


Ala Val 


Ser 


Ser 
305 


Ala 


Gin 


Ala 


Arg lie Ala Ala Arg 
310 


Lys 


Ala Arg Glu 
315 


Leu Val 


Arg 

320 


Arg 


Lys 


Ser 


Ala 


Thr Asp Leu Gly Gly 
325 


Leu 
330 


Pro Gly Lys 


Leu Ala 
335 


Asp 


Cys 


Arg 


Ser 


Thr 
340 


Asp Pro Arg Lys Ser 
345 


Glu 


Leu Tyr Val 


Val Glu 
350 


Gly 


Ann 


Scar 


Ala 

355 


61 v 


u i j uci Ala Ljo 

360 


ujy 


Aro Acn ^pr 

365 


Mot Php 


Gin 


Ala 


He 
370 


Leu 


Pro 


Leu Arg Gly Lys lie 
375 


He 


Asn Val Glu 
380 


Lys Ala 


Arg 


He 
385 


Asp 


Arg 


Val 


Leu Lys Asn Thr Glu 
390 


Val 


Gin Ala He 
395 


He Thr 


Ala 
400 


Leu 


Gly 


Thr 


Gly 


He His Asp Glu Phe 
405 


Asp 
410 


lie Thr Lys 


Leu Arg 
415 


Tyr 


His 


Lys 


He 


Val 
420 


Leu 











<210> 39 
<211> 1257 
<212> DNA 

<213> Mycobacterium kansasii 

<220> 

<221> CDS 

<222> (1)..(1257) 

<400> 39 

tec gac gec tac gcg ata teg ggc ggg ctg cac ggt gtg ggt gtc teg 48 
Ser Asp Ala Tyr Ala He Ser Gly Gly Leu His Gly Val Gly Val Ser 
15 10 15 

gtg gtc aac gca ctg tec acc egg ctg gag gtg gag ate aag cgc gac 96 
Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Lys Arg Asp 
• 20 25 30 

ggc cat gag tgg teg cag gtt tac gag aaa tec gag ccg atg gga etc 144 
Gly His Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Met Gly Leu 
35 40 45 
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10 



15 



SO 



30 



40 



SO 



aag caa gge gcg cog act aag aag acc ggc .acg acg gtg egg ttc tgg 
Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Thr Thr Val Arg Phe Trp 



50 



gec gat ccc aat gtt ttt gag acc acc gag tac gac ttc gaa acc gtc 
Ala Asp Pro Asn Val Phe Glu Thr Thr 61u T^r Asp Phe Glu Thr Val 



65 



70 



80 



gca cga egg ttg cag gag atg gcg ttt etc aac aag ggg etc acc ate 
Ala Arg Arg leu Gin Glu Het Ala Phe Leu Asn Lys Gly Leu Thr He 
85 90 95 

aat ctg acc gat cag egg gtg acc cag gac gag gtc gtc gac gag gtg 
Asn Leu Thr Asp Gin Arg Val Thr Gin Asp Glu Val Val Asp Glu Val 
100 105 110 

gtc age gac gtc gee gag gec cca aag teg gee age gag aag gcg gee 
Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Lys Ala Ala 
115 120 125 

gaa tec gec gec ccg cac aag gtc aag aag cgt acc ttc cac tat ccc 
Glu Ser Ala Ala Pro His Lys Val Lys Lys Arg Thr Phe His Tyr Pro 
130 135 140 

ggg ggt ctg gtt gac ttc gtc aag cac ate aac egg acc aag aac gee 
Gly Gly Leu Val Asp Phe Val Lys His He Asn Arg Thr Lys Asn Ala 



145 



150 



155 



ate cac age age ate gtc gac ttc tec ggt aag gga ccc gg 
He His Ser Ser He Val Asp Phe Ser Gly Lys Gly Pro Gl; 



165 



170 



160 

c cac gaa 
y His Glu 
175 



gtg gag ate gcg atg cag tgg aat gee ggc tat teg gag teg gtg ca 
Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val Hi 
180 185 190 



cat 
s 



acc ttc gec aac acc ate aac acc cac gag ggt ggg acc cac 
Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His Glu Glu 



195 



200 



205 



ggg ttc cgc age gcg ctg acc teg gtg gtg aac aag tac gee aag gac 
Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala Lys Asp 
210 215 220 

cgc aaa ctg etc aag gaa aag gac ccc aac etc acc ggc gac gac ate 
Arg Lys Leu Leu Lys Glu Lys Asp Pro Asn Leu Thr Gly Asp Asp He 
225 230 235 240 

egg gaa ggg ttg gee gcg gtg att teg gtc aag gtc age gag ccg cag 
Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ser Glu Pro Gin 



245 



250 



255 



ttc gag ggc cag acc aag acg aaa ctg ggc aac acc gag gtg aag teg 
Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val Lys Ser 
260 265 270. 



192 
240 
288 
336 
384 
432 
480 
528 
576 



lag 624 



672 



720 



768 



816 



55 
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ttc gtg cag aag gtg tgc aac gaa cag etc. acc cat tgg ttc gag gcc 864 
Phe VaT Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu Ala 
275 280 285 

aac ccc get gac get aaa acc gtt gtc aac aag gcg gtt tea teg gcg 912 
Asn Pro Ala Asp Ala Lys Thr Val Val Asn Lys Ala Val Ser Ser Ala 
290 295 300 

caa gca cgc att gcg gcc cgc aag gcg cgc gag ttg gtg cgc cgc aag 960 
Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 
305 310 315 320 

age gca acc gat ctg age gga eta ccc ggc aag etc gcc gac tgc cgc 1008 
Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp Cys Arg 
325 330 335 



teg acc gac ccg cgc aag tec gaa ctg tat gtcj gtg gag ggt gat tea 1056 
Ser Thr Asp Pro Arg. Lys Ser Glu Leu Tyr Val Val Glu Gly Asp Ser 
340 345 350 



gcc ggc ggc teg gcg aag age ggt cgc gac teg atg ttc cag gcc ate 
Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp Ser Met Phe Gin Ala He 
355 360 365 



Gly His Glu Trp Ser Gin Val Tyr Glu Lys Ser Glu Pro Met Gly Leu 
35 40 45 



1104 



2S ttg ccg ttg cgc ggc aag ate ate aac gtc gag aag gcc cgc ate gac 1152 

Leu Pro Leu Arg Gly Lys He He Asn Val Glu Lys Ala Arg He Asp 
370 375 380 

egg gtg ctg aag aac acc gaa gtc cag gcg ate ate ace gcg ttg ggt 1200 
Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala Leu Gly 
385 390 395 400 

acc ggc ate cac gac gaa ttc gac ate gcg aga ctg cgt tac cac aag 1248 
Thr Gly He His Asp Glu Phe Asp He Ala Arg Leu Arg Tyr His Lys 
405 410 415 

ate gtg etc 1257 
He Val Leu 

<210> 40 
40 <211> 419 

<212> PRT 

<213> Hycobacteri urn kansasii 
<400> 40 

45 Ser Asp Ala Tyr Ala He Ser Gly Gly Leu His Gly Val Gly Val Ser 

1 b 5 10 15 

Val Val Asn Ala Leu Ser Thr Arg Leu Glu Val Glu He Lys Arg Asp 
20 25 30 
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Lys Gin Gly Ala Pro Thr Lys Lys Thr Gly Thr Thr Val Arg Phe Trp 
50 . 55 .60 

Ala Asp Pro Asn Val Phe Glu Thr Thr Glu Tyr' Asp Phe Glu Thr Val 
65 70 75 80 

Ala Arg Arg Leu Gin Glu Met Ala Phe Leu Asn Lys Gly Leu Thr He 
85 90 95 

Asn Leu Thr Asp Gin Arg Val Thr Gin Asp Glu Val Val Asp Glu Val 
100 105 110 

Val Ser Asp Val Ala Glu Ala Pro Lys Ser Ala Ser Glu Lys Ala Ala 
115 120 125 

Glu Ser Ala Ala Pro His Lye Val Lys Lys Arg Thr Phe His Tyr Pro 
130 135 140 

Gly Gly Leu Val Asp Phe -Val Lys His He Asn Arg Thr Lys Asn Ala 
145 150 155 160 

He His Ser Ser lie Val Asp Phe Ser Gly Lys 61 y Pro Gly His Glu 
165 170 175 

Val Glu He Ala Met Gin Trp Asn Ala Gly Tyr Ser Glu Ser Val His 
180 185 190 

Thr Phe Ala Asn Thr He Asn Thr His Glu Gly Gly Thr His Glu Glu 
195 200 205 

Gly Phe Arg Ser Ala Leu Thr Ser Val Val Asn Lys Tyr Ala Lys Asp 
210 215 220. 

Arg Lys Leu Leu Lys Glu Lys Asp Pro Asn Leu Thr Gly Asp Asp He 
225 230 235 240 

Arg Glu Gly Leu Ala Ala Val He Ser Val Lys Val Ser Glu Pro Gin 
245 250 255 

Phe Glu Gly Gin Thr Lys Thr Lys Leu Gly Asn Thr Glu Val Lys Ser 
260 265 270 

Phe Val Gin Lys Val Cys Asn Glu Gin Leu Thr His Trp Phe Glu Ala 
275 280 285 

Asn Pro Ala Asp Ala Lys Thr Val Val Asn Lys Ala Val Ser Ser Ala 
290 295 300 

Gin Ala Arg He Ala Ala Arg Lys Ala Arg Glu Leu Val Arg Arg Lys 
305 310 . 315 320 

Ser Ala Thr Asp Leu Gly Gly Leu Pro Gly Lys Leu Ala Asp Cys Arg 
325 330 335 

Ser Thr Asp Pro Arg Lys Ser Glu Leu Tyr Val Val Glu Gly Asp Ser 
340 345 350 
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Ala Gly Gly Ser Ala Lys Ser Gly Arg Asp. Ser Met Phe Gin Ala He 

355 "-=360 365 

Leu Pro Leu Arg Gly Lys He lie Asn Val Glu Lys Ala Arg He Asp 
370 375 380 

Arg Val Leu Lys Asn Thr Glu Val Gin Ala He He Thr Ala Leu Gly 
385 390 39S 400 

Thr Gly lie His Asp Glu Phe Asp He Ala Arg Leu Arg Tyr His Lys 
405 410 415 

He Val Leu 



<210> 41 
<211> 20 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 41 

tcggacgcgt atgcgatatc 



<210> 42 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 42 

Ser Asp Ala Tyr Ala He Ser 

1 5 



<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 1 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 43 

acatacagtt cggacttgcg 



<210> 44 
<211> 7 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<Z23> Description of Artificial Sequence: Amino acid 
sequence corresponding to primer or probe 

<400> 44 

Arg Lys Ser 61 u Leu Tyr Val 



<210> 45 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 45 

gaagacgggg tcaacggtg 



<210> 46 
<211> S 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid 
sequence corresponding to primer or probe 

<400> 46 

Lys Thr Gly Ser Thr Val 
1 5 



<210> 47 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 47 

ccttgttcac aacgactttc gc 



<210> 48 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Amino acid 
sequence corresponding to primer or probe 

<400> 48 

Ala Lys Val Val Val Asn Lys 
1 5 



<210> 49 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 49 

gaagacgggg tcaacggta 



<210> 50 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid 
sequence corresponding to primer or probe 

<400> 50 

Lys Thr Gly Ser Thr Val 
1 5 



<210> 51 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:. Synthetic DNA 
<400> 51 

ccagtgggtc agctgttca 



<210> 52 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid 
sequence corresponding to priaier or probe 

<400> 52 



81 



# 



EP 1 098 003 A2 



Glu Gin Leu Thr His Trp 
1 5 . 



<210> 53 
<211> 22 
<212> ONA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DMA 
<400> 53 

ccttgtteac aacgactttc ga 



<210> 54 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid 
sequence corresponding to primer or probe . 

<400> 54 

Ser Lys Va1 Val Val Asn Lys 
1 5 



<210> 55 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 55 

agatcaagcg cgacgggtat 



<210> 56 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<22D> 

<223> Description of Artificial Sequence: Amino acid 
sequence corresponding to primer or probe 

<400> 56 

He Lys Arg Asp GlyTyr 
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<2t0> 57 
<211> 19 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 57 

ccagtgggtc agctgttcg 



<210> 58 
<211> 6 
is <212> PRT 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid 
sequence corresponding to primer or probe 

<400> 58 

Slu Gin Leu Thr His Trp 
1 5 



<210> 59 

<211> 38 

<212> DNA 

<213> Artificial Sequence 

30 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<400> 59 

tgtaaaacga cggccagtca ygcnggnggn aarttyga 

35 

<210> 60 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 60 

ctgcgttcgt atatgagcnc crtcnacrtc ngcrtc 



<210> 61 
<211> 17 
<212> DNA 

50 <213> Artificial Sequence 

<220> 



55 
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<223> Description of Artificial Sequence: Synthetic DNA 
<400> 61 

gacgcstayg cgatatc 17 

<210> 62 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<400> 62 

agcg9ytaca acgtcag 17 



Claims 

1 . A method for identifying a slow growing mycobacteria species, which comprises amplifying the regions correspond- 
ing to SEQUENCE NO. 7, 9, 1 1 , 1 3, 1 5, 1 7, 1 9, 21 , 23, 25, 27, 29, 31 , 33, 35, 37 and 39 in the DNA which encodes 
DNA gyrase p subunit of a slow growing mycobacteria in a sample, determining and comparing the nucleotide 
sequence of the amplified fragment with the nucleotide sequences described in SEQUENCE NO. 7, 9, 11, 13, 15, 
17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37 and 39, thereby calculating a genetic distance of the sequence of the 
amplified fragment from each sequence, and identifying the species of the slow growing mycobacteria in a sample 
based on the genetic distance. 

2. A method for detecting Mycobacterium kansasii, which comprises detecting Mycobacterium kansasli using, as a 
primer or probe, an oligonucleotide which comprises a sequence coding for a part of or the entire portion of the 
amino acid sequence described in SEQUENCE NO. 4, or a complementary sequence thereof, and substantially 
functions as a primer or probe. 

3. A Mycobacterium kansasif detection kit which comprises an oligonucleotide which comprises a sequence coding 
for a part of or the entire portion of the amino acid sequence described in SEQUENCE NO. 4, or a complementary 
sequence thereof, and substantially functions as a primer or probe. 



4. A method for detecting Mycobacterium gastri, which comprises detecting Mycobacterium gastri using, as a primer 
or probe, an oligonucleotide which comprises a sequence coding for a part of or the entire portion of the amino acid 
sequence described in SEQUENCE NO. 6, or a complementary sequence thereof, and substantially functions as 
a primer or probe. 

5. A Mycobacterium gastri detection kit which comprises an oligonucleotide which comprises a sequence coding for 
a part of or the entire portion of the amino acid sequence described in SEQUENCE NO. 6, or its complementary 
sequence, and substantially functions as a primer or probe. 

6. A method for identifying a slow growing mycobacteria species, which comprises 

checking, in DNA of a sample, the existence of a DNA comprising a unique region having a different nucleotide 
sequence In the DNA sequence coding for DNA gyrase p subunit among slow growing mycobacteria, 
identifying a bacterium in a sample based on the existence of said DNA as a marker. 

7. The method for identifying a slow growing mycobacteria species according to claim 6, wherein the slow growing 
mycobacteria are Mycobacterium slmiae, Mycobacterium bovis, Mycobacterium szulgai, Mycobacterium mal- 
moense, Mycobacterium intracellulare, Mycobacterium avium, Mycobacterium gondonae, Mycobacterium africa- 
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num. Mycobacterium tuberculosis, Mycobacterium gastrt, Mycobacterium marinum t Mycobacterium microti, 
Mycobacterium asiaticum, Mycobacterium scrofulaceum, Mycobacterium branderi, Mycobacterium 
paratuberculosls. and Mycobacterium kansasii. 

5 8. The method for identifying a slow growing mycobacteria species according to claim 6, which comprises the follow- 
ing steps (1)to(4): 

(1 ) synthesizing an oligonucleotide which comprises a unique region having a different nucleotide sequence in 
the DNA sequence coding for DNA gyrase p subunit among slow growing mycobacteria, 
io (2) preparing a solution which comprises the oligonucleotide synthesized in the step (1), dNTP, DNA polymer- 

ase and a bacterial DNA In a sample, 

(3) heating the solution prepared in the step (2) repeatedly under such conditions that polymerase chain reac- 
tion can occur, and 

(4) subjecting the solution obtained in the step (3) to electrophoresis to identify the species of the bacterium in 
15 a sample based on the electrophoresis pattern. 

9. The method for identifying a slow growing mycobacteria species according to ciaim 8 f wherein the oligonucleotide 
is an oligonucleotide which encodes the amino acid sequence described in SEQUENCE NO. 46, SEQUENCE NO. 
48, SEQUENCE NO. 50, SEQUENCE NO. 52, SEQUENCE NO. 54, SEQUENCE NO. 56 or SEQUENCE NO. 58. 

20 

10. The method for identifying a slow growing mycobacteria species according to claim 8, wherein the oligonucleotide 
is an oligonucleotide represented by SEQUENCE NO. 45, SEQUENCE NO. 47, SEQUENCE NO. 49, SEQUENCE 
NO. 51, SEQUENCE NO. 53, SEQUENCE NO. 55 or SEQUENCE NO. 57. 

25 11. The method for identifying a slow growing mycobacteria species according to claim 6, which comprises the follow- 
ing steps (1)to(4): 

(1 ) synthesizing a first oligonucleotide which is identical to a first partial sequence in the DNA sequence coding 
for the DNA gyrase p subunit of slow growing mycobacteria and a second oligonucleotide which is complemen- 

30 tary to a second partial sequence in the DNA sequence coding for the DNA gyrase p subunit of slow growing 

mycobacteria, said first partial sequence and said second partial sequence being respectively conserved 
among slow growing mycobacteria, 

(2) subjecting the two oligonucleotides synthesized in the step (1) as primers and a bacterial DNA sample as 
a template to the polymerase chain reaction, 

35 (3) mixing the DNA fragment amplified in the step (2) with a restriction enzyme under the conditions at which 

the restriction enzyme is active, said restriction enzyme recognizing the sequence unique to one or more slow 
growing mycobacteria, and 

(4) subjecting the mixture obtained in the step (3) to electrophoresis to identify the species of the bacterium in 
a sample based on the electrophoresis pattern. 

40 

12. The method for identifying a slow growing mycobacteria species according to claim 1 1 , wherein the two oligonucle- 
otides to be used as primers are oligonucleotides represented by SEQUENCE NO. 1 and SEQUENCE NO. 3, and 
the restriction enzymes to be used are Rsa I and Taq I. 

45 13. An identification kit for a slow growing mycobacteria species, which comprises an oligonucleotide containing a 
region of DNA coding for DNA gyrase p subunit, a region having different nucleotide sequence among slow growing 
mycobacteria. 

14. An identification kit for a slow growing mycobacteria species, which comprises 

so 

a first oligonucleotide which is identical to a first partial sequence in the DNA sequence coding for the DNA 
gyrase p subunit of slow growing mycobacteria, 

a second oligonucleotide which Is complementary to a second partial sequence In the DNA sequence coding 
for the DNA gyrase p subunit of slow growing mycobacteria, said first partial sequence and said second partial 
55 sequence being respectively conserved among slow growing mycobacteria, and 

one or more restriction enzyme recognizing the sequence unique to one or more slow growing mycobacteria. 

15. A method for detecting a slow growing mycobacteria species, which comprises 
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checking, in DNA of a sample, the existence of a DNA comprising a unique region having a different nucleotide 
sequence in the DNA sequence coding for DNA gyrase fj subunit among slow growing mycobacteria, 
detecting a bacterium in a sample based on the existence of said DNA as a marker. 

5 16. The method for detecting a slow growing mycobacteria species according to claim 1 5, wherein the slow growing 
mycobacteria are Mycobacterium s/m/ae, Mycobacterium bovis, Mycobacterium szulgaf, Mycobacterium mal- 
moense, Mycobacterium intraceiiulare, Mycobacterium avium, Mycobacterium gordonae, Mycobacterium africa- 
num, Mycobacterium tuberculosis, Mycobacterium gastri, Mycobacterium marinum, Mycobacterium microti, 
Mycobacterium asiaticum, Mycobacterium scrofulaceum, Mycobacterium branderi, Mycobacterium 

w paratuberculosis, and Mycobacterium kansasIL 

1 7. The method for detecting a slow growing mycobacteria species according to claim 1 5, which comprises the follow- 
ing steps (1) to (4): 

is (1 ) synthesizing an oligonucleotide which comprises a unique region having a different nucleotide sequence in 

the DNA sequence coding for DNA gyrase P subunit among slow growing mycobacteria, 

(2) preparing a solution which comprises the oligonucleotide synthesized In the step (1), dNTR DNA polymer- 
ase and a bacterial DNA in a sample, 

(3) heating the solution prepared in the step (2) repeatedly under such conditions that polymerase chain reac- 
20 tion can occur, and 

(4) subjecting the solution obtained in the step (3) to electrophoresis to detecting the bacterium in a sample 
based on the electrophoresis pattern. 

1 8. The method for detecting a slow growing mycobacteria species according to claim 1 7, wherein the oligonucleotide 
25 is an oligonucleotide which encodes the amino acid sequence described in SEQUENCE NO. 46, SEQUENCE NO. 

48, SEQUENCE NO. 50, SEQUENCE NO. 52, SEQUENCE NO. 54, SEQUENCE NO. 56 or SEQUENCE NO. 58. 

19. The method for detecting a slow growing mycobacteria species according to claim 17, wherein the oligonucleotide 
is an oligonucleotide represented by SEQUENCE NO. 45, SEQUENCE NO. 47, SEQUENCE NO. 49, SEQUENCE 

30 NO. 51, SEQUENCE NO. 53, SEQUENCE NO. 55 or SEQUENCE NO. 57. 

20. The method for detecting a slow growing mycobacteria species according to claim 15, which comprises the follow- 
ing steps (1) to (4): 

(1 ) synthesizing a first oligonucleotide which is identical to a first partial sequence in the DNA sequence coding 
for the DNA gyrase 0 subunit of slow growing mycobacteria and a second oligonucleotide which is complemen- 
tary to a second partial sequence in the DNA sequence coding for the DNA gyrase subunit of slow growing 
mycobacteria, said first partial sequence and said second partial sequence being respectively conserved 
among slow growing mycobacteria, 

(2) subjecting the two oligonucleotides synthesized in the step (1) as primers and a bacterial DNA sample as 
a template to the polymerase chain reaction, 

(3) mixing the DNA fragment amplified in the step (2) with a restriction enzyme under the conditions at which 
the restriction enzyme is active, said restriction enzyme recognizing the sequence unique to one or more slow 
growing mycobacteria, and 

(4) subjecting the mixture obtained in the step (3) to electrophoresis to detect the bacterium in a sample based 
on the electrophoresis pattern. 

21. The method for detecting a slow growing mycobacteria species according to claim 20, wherein the two oligonucle- 
otides to be used as primers are oligonucleotides represented by SEQUENCE NO. 1 and SEQUENCE NO. 3, and 

so the restriction enzymes to be used are Rsa I and Taq i. 

A detection kit for a slow growing mycobacteria species, which comprises an oligonucleotide containing a region of 
DNA coding for DNA gyrase p subunit, a region having different nucleotide sequence among slow growing myco- 
bacteria. 

A detection kit for a slow growing mycobacteria species, which comprises 

a first oligonucleotide which is identical to a first partial sequence in the DNA sequence coding for the DNA 



40 



22. 

55 

23. 
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gyrase p subuntt of stow growing mycobacteria, 

a second oligonucleotide which Is complementary to a second partial sequence in the DNA sequence coding 
for the DNA gyrase P subunit of slow growing mycobacteria, said first partial sequence and said second partial 
sequence being respectively conserved among slow growing mycobacteria, and 
5 one or more restriction enzyme recognizing the sequence unique to one or more slow growing mycobacteria. 

24. A method for identifying a slow growing mycobacteria species, which comprises the following steps (1) to (4): 

(1) synthesizing an oligonucleotide which comprises a sequence corresponding to a region in the DNA gyrase 
10 p subunit wherein the 3'-side nearest neighbor base to said region in the DNA gyrase p subunit Is a unique 

base among the slow glowing mycobacteria, 

(2) preparing a solution which comprises the oligonucleotide synthesized in the step (1), a labeled ddNTR DNA 
polymerase, and a bacteria! DNA in a sample, 

(3) heating the solution prepared in the step (2) under such conditions that reaction between the labeled 
15 ddNTP and the oligonucleotide occurs, 

(4) checking the existence of the labeled oligonucleotide, and 

(5) Identifying a bacterium in a sample based on the existence of the labeled oligonucleotide. 

25. A method for detecting a slow growing mycobacteria species, which comprises the following steps (1) to (4): 

20 

(1) synthesizing an oligonucleotide which comprises a sequence corresponding to a region in the DNA gyrase 
P subunit wherein the 3'-side nearest neighbor base to said region in the DNA gyrase p subunit is a unique 
base among the slow glowing mycobacteria, 

(2) preparing a solution which comprises the oligonucleotide synthesized in the step (1 ), a labeled ddNTP, DNA 
25 polymerase, and a bacterial DNA in a sample, 

(3) heating the solution prepared in the step (2) under such conditions that reaction between the labeled 
ddNTP and the oligonucleotide occurs, 

(4) checking the existence of the labeled oligonucleotide, and 

(5) detecting a bacterium in a sample based on the existence of the labeled oligonucleotide. 
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Fig. 1 



KPM2201 G3CGA6AACACCGGCTACAC6GTCIAGCfiGTGGGTTGCACGGCGTG6GCGT6TC<jGT6GTT 
ATCC25274 GGTGAGAACAGC6GCTACACCGTCAGCGGT6GGCTGCAKGT6TCGGTGTGTCAGTGGTC 
KPM1 403 GGGGAGMCAGTGSCTACACCGTCAGCGGC6GGTTGCAOK3GGTCGGAGTGTCGGTSGTC 
KPII2027 6GC6ASMCA6CGGCTA(^TCA6C(^G66nSCAWGA8T6G6CSTSTCS6TG6TC 
KPM 201 GGC6AGAACAGTGGTTACAACGTCAGTGGTGGTCTGCACS6C6TGGGTGTGTCGQTGGTC 

KPM2403 GGCGAGAACAGTGGC7ACAACG7CAG7GGT667C7GCACGGC67CG6GG7G7CGG7GG7G 

KPM3012 GGCGAGAACAGCGGCTACAACGTCAGCGGC6GTCTGCACGGCGTCGGCGTCTCGGTGGTC 

BovinelO GGCGAGAACAGCGGCTACAACG7CAGCGGCGG7C7GCACGGCG7C66CG7C7C667GG7C 

KPM31 01 GGTGA6AACAGC6GTTACAACGTCA6C6GTGGCCTGCACGGCGTGGGCGTCTCGGTGGTC 

KPM3401 GGCGA6AACAGCGGATACAACGTCAGTGGCGGTTTGCACGGTGTCGGCGTGTC66TG6TC 
ATCCS1769. ■ GGCGATGACAGCGCCTAC6CGGTCTCGGGTGGTCTGC.ACGGCGTGGGCGTGTCGGTGGTC 

T801 7CGGAC6CQTAT6CGATATCT6GTG6TCTfiCACeeCCTr.ftfimTRTrcfiTffCTT 

7901 7CGfiACGCCTATea3ATATCT fiGTfiCTCTfiCAMfi^Cfim'.fiTBTP.fifiTgCTT 

T704 TC6GACGC6TAT6CGATATC TG6TGfiTCTfiCACfi6C6TCgef!RTfiTRfiflTflfiTT 

7021 7CGGACGCG7ATG(^7ATC T6GTfiGTCTfiCAMSf:CTfflftflfiTftTf!gCTflftTT 

KPM3504 7CC6ACGCC7AT60GA7A7CGGG7GGAC7GCACGG767 GG67G7CTCSG7GG7C 

KPM1001 . 7GCGACGCC7A0G06A7ATCGGGCSGGC7GCACGG7G7 6GG7G7C7CGG7G67C 6 0 

Sequehfce.No. 41 • * ** * . ** ** ******* *+ ****** ***** 

Sequence No. 61- j — ; 

Sequence No. 62; ] Sequence No. 1 

KPM2201 MCGCG77G7CGACGCGG77GGAAG7(^CA7(^WCGA(»6€CACGAGTG6TCGCAG 

ATCC25274 AACGC6nGTCGACCC6ACTCGAGG7CGACA7CAAGCGCGA(KG6CACGAG7GGTCCCAG 

KPM1403 AACGCCCTG7CCACC^CC7G6AAG7CMCG7(JMGC676ACGGCTA7GAG7GGnCCAG 

KPM2027 AAW«SCTGT(«ACC<MCC7GG>GG7CACCA7CMGMC»WG6GCACGAG7G6n7CAG 

KPB1201 AACGCGCTG7CCACCC6ACTGGAAG7C6ACA7CAAGCGCGACG6A7ACGAG7G67CGCAG 

KPM2403 AACGCGC7G7CGACCCGGC7C6AGG7CGACA7CAAGCGTGACGGCCACAAG7GG7CGCAG 

KPM301 2 AACGCGCTG7CCACTCGGC7CGAGG7CAACA7C6CCCGCGACGGC7ACGAG7GG7CGCAG 

BovinelO AACGMCTG7CCAC7CGGC7<^G67CAACA7CG{mC6AC6GC7ACGAG7GG7CGCAG 

KPH31 01 AAC6CGCTG7C6ACCCG6C7CGAGG76GACATCGCCCGCGA7G6C7ACGAA7GGTCGCAG 

KPM3401 AACGCGTrG7CGACCCGGCTCGAGG7GGA7G7CGCCCGCGAC6GC7ACA7G7GG7CACAG 

A70C51 789 AACGCA77G7CGAC7CGAC7CGAGGTGGAGATCGCGACCGAC66G7ACGAG7GG77TCAG 

T801 AACGCGCTATCCACCCGGC7CGAAG7CG AGATCAAGCGCGACGGG7AC GA6TGGTCTCAG 

7901 AACGCGC7A7CCACCCGGC7CGAAG7C GAGA7CAA6CGCGAC6GG7A7 GAG7GG7CTCAG 

7704 AACGC6G7A7roACCC6GC7CGAA67C GAGA7CAA6CGCGACGSGTACG A676G7C7CAG 

7021 AACGCGC7A7CCACCCG6C7CGAA67C 6AGA7CAAGCGCGAC6GG7AC6 AG7GG7C7CAG 

KPM3504 AWGMC7G7CCA7CCGGC76GAGG7GGAGA7CAAGCGCGACGGCCA7GAG7GG7CGCAA 

KPM1001 AACGCAC7G7CCACCCGGC7GGAGG7GGAGA7CAAGC6CGACGGCCA7GAG7GG7CGCAG 120 
***** * ** * ** ***** ** ** * **** ** 
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Fig. 2 
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TAnACAAGC6CGC6&TGCC6G6CACCCTCAAGCA6GGT6A6ACGACCCGCAAGACCGSC 
TATTACGAQCGC6CC6TTCCTGGCACSCTCAAGCAGGGCGAGGCSACCAASAAGACC6SC 
TACTACGACCGGGCGGTGCCCG3CACCCTCMGCAAGGCSA66CGACCAAGAAGACCGGC 
TACTACGACCGCGCCinGCCCGGMCCCTCAA6CAG6GCGA6GCCACCAAGAAGACC8GA 
nCTACGACCGMCCCAGCCGGGCAKCTCAAACAGGGCGAGGCAACCAAGMGAOiG^ 
TTCTACAACMGGCC6TGCC6GGCAG6CrrCAMCA5GGTGMGCCACTMGAAAACC6GA 
TACTACGAMAC^CCCTGCCCGGCACCCTCAAGCAGGGCSAGGCCACCAAGCSCACCeGC 
TACTA(^CCACGCCfiTGCC^G6CA(XCTCMGCAGG(H;6A6^CGACCAAGCGCACCGGC 
nCTAMACCACGCC^TACCCGGAACGCTCAAACAGGGTGAGGCCACCAABCGGACGGGC 
TTCTAC^TCACGC<^GCCGGGAACCCTCAAACAG6GCGAGGCCACCAAGACGACGGGA 
CAnACGACCGCTCT6TC<XX^GCACGCTCAAGCAA6GCGA6AAAACCAAAAA6ACCGGC 
GTTTATGAGAAGTCGGAACCCCTGG6CCTCAAGCAAGGGGCGCCGACCA AGAAGACGGGG 
GTTTATGAGAAGTCgGMCCCCTGGGCCTCAAGCAAGGGGCGGCGACCA AGAAGAQjGGG 
GnTATGAGMGT(^GAACCCCT6G6CCTCAA6CAA6€GGC1SCC6ACCA A6AAGAC66GG 
GTTTATGAGMGTCGGMCCCCTGGGCCTCAAGCAAGGGGCGCCGACCA AGAAGACGGGG 
GTTTATGAGMGTCCGAGCCGATGGGACTCAAGCAAGGCGC^CCGACGAAGMGAGCGGC 
6mACGAGAAATCCGAGCC6AT66<3ACTCAA6CAAGGCGCGCCGACTAAGAAGAC^GGC 180 
** * * ** ***** ***** ** *♦ ** 

Sequences No. 45. 49 

ACCACAATCCGGTTCTGGGCGGATCCGGAGATCTTCGAGACCACCCAATACGACTTCGAG 
ACCAW5ATCCGGnCTGGGCGGACCCGGACATCrrC6AGACCACCCAGTACGACTTCGA6 
ACCACGATMGGTTCTGGGCCGATCCTGAGATCnCGAAACCACCCAGTACGACnCGAG 
ArcACGATCAGGTTCTGG6CGGACCCWAAATCnCGAAACCACACAGTACGACnCGAG 
ACCACCATCCGGTTCTGGGCCGACTCGGACATCTTTGAGACCACCGAATACGACTTCGAG 
AC^ACAAnAGGnCTGGGCCGACCCGGACATCTTCGAGACCACCSAATACGACTTCGAG 
ACCACCATCCG6TTCTGGGCCGACCCCGACATCTTCGAGACCACCGA6TACGACTTCGAA 
ACCACCATCCGGTTCTGGGCCGACCCCGACATCTTCGAGACCACCGAGTACGACTTCGAA 
ACCACGATCAGGnCTGGGCCGAC«:CGACATCTTCGAGACCACCGAGTAC6ACnCGAG 
ACCACCATCA6GTTCTGGGCCGATCCCGACATCTTCGA6ACCACCQAGTACGACTTCGAG 
ACCACtjGTCCGCnCTGGGCCGACCCGGACATCTTCGAGACGACGGATTACGACTTCGAG 
TCAACGGTG CGGnCTGGGCCGACCCCGCT6TTTTCGAAACCACGGAATACGACTTCGAA 
TCMC1SGTG C6GnCTGG6CCGACCCCGCTGTTTTCGAAACCACGGAATACGACTTCGAA 
TCAACGGTA CfiGnCTGGGCCGACCCCGCTGTTTTCGAAACCACGGAATACGACTTCGAA 
TCAACGGT6 CGGTTCTGGGCCGACCCCGCT6TTTTCGAAACCACG6AATACGACTTCGAA 
AI^CGGTGCGGnCTGGGCCGACCCCAACGTTTnGAMCCACCGAGTACGACTTCGAA 
ACGACGGTGCGGTTCTGGGCCGATCCCAATGT7TTTGAGACCACCGAGTACGACTTCGAA 240 



* ** *„ * ******** ** * 
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Fig. 3 

KPU2201 ACG5TGGMCGCCGQCTGCAGGAGAT6GC6TTCCTGAACAAGG6TCTGACGATCAATCT6 

ATCC25274 ACGGTGGCGCGCCGGCTrcAAGAGATGSCGTTCCTGMCMGGQCTTGACCATCAACTTS 

KPW403 ACGGTGGCGCGCCGGnGCAGGAAATGGCGTTCCTCMCAAGGGCCTGACCATCAACCTC 

KPH2027 ACC6T6GCGCGGCGGCTGCA(^(3ATGGCCnCCTCAACAAGGGCCTCACCATCAACCTC 

KPM1201 ACGGTGSCGCGGCGCCTGCAGGAGATGGCGTTCCTCAACAAGGGCCTGACCATCAACCTC 

KPH2403 ACX^TGGCACGCCGGCTGCAGGAAATGGCATTCCTGAACAAGGGCTTGACCATCAACCTC 

KPH3012 ACGGTGGCCCGGCGGCTGCAGGAAATGGC6TTCCTCAACAAGGGCCTGACCATCAACCTC 

Bovine 10 ACGGTGGCGCGGCGGCTGCASGAMTGGCGTTCCTCAACAAGGGCCTGACCATCAACXJTC 

KPH3101 AC6GTGGCGCGCCGGCTGCASGAAATGGCGTTCGTCAACAAGGGGTTGACCATCAACCTC 
KPM3401 . ACGGTGGCGCGCCGACTGCA6GAAAT6GCGTTCCTGAACAAGGGTTTGACGATCAACCTG 

ATCC51789 ACGGTCGCACGCGGGCTGCAG6AAAT66CGTTCCTCAACAAAG66CT6ACCATGAACCTG 

T801 ACCGTCGCCCGCCGGCTGCAA6AGATGGC6TTCCTCAACAASGG6CT6ACCATCAACCTG 

T901 ACCGTC6CCCGCCGGCTGCAAGAGAT6GCGTTCCTCAACAAGGG6CTGACCATCAACCTG 

T704 ACCGTCGCCCGCCGGCTGCAA6AGATGGCGTTCCTCAACAAGGGGCTGACCATCAACCTG 

T021 ACCGTCGCCCGC^GCTGCAAGAGAT6GCGTTCCTCAACAAGGGGCTGACCATCAACCT6. 

KPM3504 ACCGTCGCGCGAC6GTTGCAGGAGATGGCGTTTCTCAACAAGGGGCTCACCATCAACCTG 

KPM1001 ACCGTCGCAC6ACGGTTGCAGGAGATG6CGTTTCTCAACAAGGGGCTCACCATCAATCTG 300 
********** ***** ***** ** ** 1 1 h i ***** ***** * 

KPK2201 ACK^C6AA0GC6TC6AGCAG6A(^G6nGTCGACGAfiGTCGTCAGC8ACACCGCCGAA 

ATCC25274 ACGGACGAGC66ST6GACCAGGAC6AGSTCGTCGATGAAGTC6TCASCGACACCGCCGAT 

KPH1 403 ACCGACGAACGT6TCGAGCAGGAC6A6GTGGTCGAT6AG6TG6TTAGCGACACCGCCGAG 

KPM2027 ACCGACGAACGAGTGGAGCAGGACGAGGTCGTCGACGAGGTCGTCAGCGACACCGCCGAG 

KPM1201 AWJGACGAGCGG6TCACCCC6GACGAGGTCGTCGACGACGTCGTCAGTGATACC6CCGAA 

KPM2403 ACCGACGAGCGA6TTGCCCAGGACGAGGTTGTCGAC6AGGTCGTCAGCGACACCGCCGAG 

KPB3012 ACCGACGAGCGGGTGACCAACGAAGAGGTCGTC6AC6AGGTGGTCAGCGACACCGCCGAC 

Bovi ne1 0 ACCGACGAGCGGGTGACCAAC6AAGA6GTCGTCGAC6AGGT6GTCA6CGACACCGCCGAC 

KPH3101 ACGGACGAGC6G6TSAGCAACGAG6AGGTCGTCGACGAGGTCGTCAGCGATACCGCCGAC 

KPM3401 ACCGACGA6C6GGTCAGTGAAGAGGAGGTCGTC6AC6AT6TCGTCAGCGACACCGCCGAG 

ATCC51 789 ACCGACGAGCGGGTGCGAAACGAAGAAGTCGTCGAGGAGGTCGTCAGCGACACCGCCGAC 

T801 ACCGACGAGAGGGTGACCCAAGACGAGGTCGTCGACGAAGTGGTCAGCGACGTCGCCGAG 

T901 ACCGACGAGAGGGTGACCCAAGACGAGGTCGTCGAC6AAGTGGTCAGCGAC6TCGCCGAG 

T704 ACCGACGAGAGGGTGACCCAAGACGAGGTCGTCGACGAAGTGGTCAGCGACGTCGCCGAG 

T021 accgacgagagggtgacccaagacgaggtcgtcgacgaagtggtcagcgacgtcgccgag 

KPU3S04 ACCGATCAGCGGGTAACCCAGGA(^GTGGT(^CGAGGTGGTCAG{»ACGT(^CCGAG 

KPM1001 ACCGATCAGCGGGTGACCCAGGACGAGGTCGTCGACGAGGTGGTCAGCGACGTCGCCGAG 360 
***** * * ** ****** ***** ********** ****** 
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Fig. 4 

KPM2201 K6C<EAAATCraCC6MGAGAAGGCTGCC^ 

ATCC25274 GCGCCCMGTCCGCCGAASAGAAGGCGGCCGAATCCAAAGCGCCGCACAAGGTTAAGCAC 

KPM1403 GCGCCGMGICAGCCGAGGAGCAGGCMSGCCGMTCGGCCAAGCCGCACAAGGTCAAGCAC 

KPH2027 GCACCGAAGTCC6CMAAGAGMGGCCGCGGAATCGACTGCGCCACACAAGGTCAAGCAC 

KPH1201 GCACCAAAGTCMCCCAGGAGMGGCCGCWAATCGACCGCGCCGCACAAGGTCAAGAGC ' 

KPU2403 6CACCCAA6TCCGCCGMGAAMGGreGCCGMTCi;AAAGGGCCGCATAAGGTTAAGCAC 

KPM3012 GCACCCMGTCGGC6WG6AGM6GCGGCGGAATCGGCTGCGCCGCATAAGGTCAAGCAC 

Bov t net 0 GCACCCAAGTC6GC6CAGGAG AAGGCGGCGGAATCGGCTGCGCCGCATAAGGTCAAGCAC 

KPM3101 6(^CCCAAGTCGGCCCA6GAAAAGGCGGCGGAATCGACTGCGCCACATAAGGTTAAGCAC 

KPU3401 GCACCCAAGTCMCCGTA6AAAAAGCGGCCGAATCGACTGGCCCACACAAGGTTAAGCAC 

' ATCC51 789 GCGCCGAAGTCGGCGCGCGAAGAGGCCGAAGAACGGACCA — CGCAGAAAGTCAAGCAG 

T801 GCGCCGAAGTCGGCAAGTGMCGCGCAGMGAATCCAGTGCACCIKACAAAGTTAAGAGC 

T901 GCGCCGAAGTCGGCAAGTGAACGCGCAGCCGAATCCACTGCACCGCACAAAGTTAAGAGC 

T704 GCGCCGAAGTC36GCAA6TGAAC6CGCAGCCGAATCCACTGCACCGCACAAAGTTAAGA6C 

T021 GCGCCGAA6TCGGCAAGT6AACGCGCAGCCGAATCCACTGCACCGCACAAAGTTAAGAGC 

KPD3504 GCCCC6AAGTCGGCCAGT6A6AAGGCGGCCGAATTCACCGCCCCCCACAAG6TGAAGAAG 

KPM1001 6CC(^AAAGTCGGCCAGXaA6AAGGCGGCC1jAA TCC6CC6CCCCGCACAAG6TC AA6AAG 420 

********** ** ** * *** ^_ ++±j£l3± *** 

Sequences No. 3, S 

KPM2201 CGCACCTTCCACTATCCC6GTGGTCTGGTCGACTTCGTCAAACACATCAACCGCACGAAA 

ATCC25274 CGCACCnCCACTACCCCGQCGGCTTGGTCGACTTCGTCAAGCACATCAACCGGACCAAG 

KPM1403 CGCACGTTCCACT ACCCG6GTGGGTTGGTGGATTTCGTCAAGCACATCAATCGCACCAAA 

KPH2027 CGCACCTTCCACTACWXJGGCGGTCTGGTCGACTTCGTCAAGCACATCAACCGCACCAAG 

KPM1 201 C6CACCTTCCACTATCCCGGCGGTTTG6TCGATTTCGTCAAGCACATCAACCGCACCAAG 

KPH2403 CGCACTTTCCATTAM(X»GG«GCTGATMACTTC6TCAAGCACATCAACCGGACCAAG 

KPM301 2 CCCACCnCCACTACCCCGGCGGCCTGGTCGACTTCGTCAAACACATCAATCGCACCAAA 

KPK31 01 CGCACCHCCACT ACCCCGGCGGTCT6GTCGACTTCGTCAAGCACATCAACCGCACCAAG 

KPM3401 CGCACGTTCCACTACCCGGGCGGCTTGGTGGACTTC6TCAAGCACATCAATCGGACCAAG 

ATCC51789 CGCAC6nCWTTACCCCGGCGGCTTGGTC6ATTTCGTCAAACACATCAACCGCACAAA6 

T801 CGCACCTnCACTATCCGGGTGGOCTGGTGGACnCGTGAAACACATCAACCGCACCAAG 

T901 CGCACCTTTCACTATCCGGGTGGCCTGGTGGACnCGTGAAACACATCAACCGCACCAAG 

T704 C^CACCTnWCTATCCGGGTGGCCTGGTGGACTTCGTGAMCACATCAAMGCACCAAG 

T021 CGCACCTTTCACTATCCGGGTGGCCTGGTGGACTTCGTGAAACACATCAACC6CACCAAG 

KPM3504 CGTACCTTTCACTATCCCGGTGGCTTGGTTGACTTCGTCAAGCACATCAACC6CACCAAG 

KPM1 001 CGTACCnCCACTATCCCGGGGGTCTGGnGACTTCGTCAAGCACATCAACCGGACCAAG 480 

Mi *ak *=* 4cfc 4ok jfc* *uk * atafc. a ja fca fa tefc *afc jfc*4a|at*** M Hak 
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Fig. 5 

KPH2201 AGCWGATCCASCASAGCGTCATC6ACTTIKAAGGCAAAGGCACC6GCCACGAGGTCGAA 

ATCC25274 AGCCCGATCCAACAGAGCGTCATCGACTTCQAGGffiAAAGGCACCGGCCACGAGGTCGAG 

KPH1403 AACCCGATCCAGCAGAGC(nCATCGAGTTCGACfiGCAM6QAACCG(jGCACGAAGTCGAG 

KPM2027 AGtXCGATCW(5CASAGC^WTCGAmC6AC6GCAAGGGCACCGGCCACWGG 

KPM1201 AGTWXjATTCAGCAGAGCATCGTCGACTTCGAGGa^GGGCTCCGGCCACGAAGTCGAA 

KPM2403 AGC(X2GATCCAGCASA6T&TCGTCG(XrrrC6ACGGCAAG6GT6AA68GCACGAGGTCGAG 

KPM3012 AACC^ATCCJVCCAGAGCATCATCGATnCGGTGGGMGGGCCCCGQCCACGAGGTCGAG 

BovinelO AACCCCAT(XACCAGAGCATCArCGATnCGGTGKAASGGCCCCGGCCAC6AGGTCGAG 
KPM3101 ' AG(^GATCCAGCAGAGXJATCAT(^ACnCt^C6GCAAAGGrCCCGGCCACGAGGTCGAG 

KPB3401 MCCCGAnwCMCAGCATCGTGGAmCTCCSGCAAGGWCCG6GCCAC6AGGTCGAA 

ATCC51789 MWX3CATCCAnCGAGCATCGTCGACTTCTCCGGCAAGGGTCCCGGCCACGAGGTCGAG 

T801 AAC6CGAnCATAGCA6CATCGTG8ACTnTCC6GCAA66SMCCGGGCAC6AG6TGGAG 

T901 AAC6WWTTCATAGCAGCATCGTGGACTTTTCCGGCAAGGGCACCGGGCACGAGGTGQAG 

T704 AACGMAnWTAGCAGCATCGTGGACTTTTCCGGCAAGGGCAXGGGCACGAGGTGSAG 

T021 AACGCGATTCATAGCAGCATCGTGGACTTTTCC6GCAAGGGCAIXJGG6CACGAGGTGGAG 

KPM3504 AACSCCATCCACA6CASCATCGTCGACnrTCTCCGGAAAG6GGACCGGCCAC6AAGTGGAQ 

KPM1001 AACGCCATCCACAGCAGCATCGTC6ACTTCTCCGGTAAGGGACCCGGCCACGAAGT6GAG 540 

* ***** ** ** * * ** ****** ** ***** ** ** 

KPH2201 ATC6C^T6<aGTG6MCGGCG6CTACTCCGMT^ 

ATCC25274 ATCGCGAT6CAGTGGAAC6GTGGCTACTCGGAGTCGGTGCAGACCTTCGCCAACACGATC 

KPM1403 ATCGCGAT6CA6TG6AACGGTGGTTATTCG6AGTC6GTGCACACCTTCGCCAACACCATC 

KPH2027 ATC6CCAT6CA6TGGAAC(WCGaTACTt^GTCC6TCCACACCnCGCCAACACCATC 

KPM1 201 ATCGCGATGCA6TGGAACGGCGGCTACTCGGA6TCGGTGCACACCTTCGCCAACACCATC 

KPM2403 ATCGCGATGCAGT6GMCGGCGGCTATTCGGAGT(XGTGCAWCCTTC6CCAACACCATC . 

KPH3012 ATC6CGATGCAGTGGAACGGC6GCTACTCCGAATCG6T6CACACCTTCGCCAACACCATC 

BovinelO ATC6CGAT6CAGTGGAACGGCGGCTACTCCGAATCGGTGCACACCTTCGCCAACACCATC 

KPH31 01 ATCGCGATGCAGTG6MCGGCG6CTACTCG6AATCCGTGCACACCTTCGCCAACACCATC 

KPB3401 ATCGCGATGCAGTG6AATGCDGGCTACTCGGAGTCGGTGCACACCTTCGCCAACACCATC 

ATCC51 789 ATCGCAAT6CAGTGGAACGCC66CTATTCGGAGTCGGTGCACACCTTCGCCAACACCATC 

T801 ATCGCGATGCAAT6GAACGCCG6GTATTC6GAGTC6GTGCACACCTTCGCCAACACCATC 

T901 ATCGCGAT6CAATGGAACGCCGGGTATTC6GAGTCGGTGCACACCTTCGCCAACACCATC 

T704 ATCGCGATGCAATGGAACGCCGGGTATTCGGAGTCGGTGCACACCTTC6CCAACACCATC 

T021 ATCGCGATGCAATGGAACGCCGGGTATTCG6AGTCGGTGCACACCTTCGCCAACACCATC 

KPJI3504 ATCGCGATGCAGTGGAATGCCG6CTATTCflGAGTCGGT6CACACCTTCGCCAACACCATC 

KPH1001 ATCGCGATGCAGTGGAATGCCGGCTATTCGGAGTCGGTGCATACCTTCGCCAACACCATC 600 

I " I I ***** ***** * ************** hi | | m < i|, M Mb) i *** 
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Fig. 6 

KPM2201 MCACCCACGAGGGCGSCACCCACGAAGAGGGCnCCGCAGTGCGCTGACCTC6GTGSTC 

ATCC25274 AACACCCACGAGGGCGSTACGCACGMGAAGGSTTCCGCAGTGCGCTGACGTCGGTGSTG ■ 

KPM1403 AACACCCATGAGGGCG6CACCCACGAGGAGGGCTTCCGCAGCGCGCTGACCTCGGTG8TG 

KPU2027 AACACGCACGAGGGCG6CACCCACGAGGAGGGCTTCCGCAGCGCGCTGACGTCG6TGSTG 

KPM1201 MCACCCATGAGGGTG6AACGCACGAA6AGGGCTTCCGCAGTGCGTTGACCTCGGTGGTG 

KPH2403 AACACCCACGAGGGCGGCACCCACGAAGAAGG6TTCCGCAGCGCACTGACATCGGTGGTG 

KPU301 2 AACACGCACGAGGGCGGCACCCACGAGGAGGGCTTCCGCAGCGCGCTGACCTCCGTGGTC 

Bovinel 0 AACACGCAC6AGGGC6GCACCCACGAGGA66GCTTCC6CAGCGCGCTGACCTCCGTGGTC 

KPH31 01 AACACCCACGAGGGCGGCACCCACGAAGAGGGCTTCCGCAGCGCGCTGACGTCG6TG6TG 

KPM3401 AACACCCACGAGGGCGGCACCCACGAAGAGGGCTTCCGCAGCGCGTTGACGTCGGTGGTC 

ATCCS 1 789 AACACCCAC6AGGGC6GCACCCACGAAGAAGGGTTCCGCGCGGCACTGACGTCCGTGGTG 

T801 MCACCCACGAGGGCGGCACCCACGAAGAGGGCTTCCGCAGCGCGCTGACGtCGGTGGTG 

T901 AACACCCACGAGGGC66CACCCACGAAGAGGGCTTCCGCAGCGCGCTGACGTCGGTGGTG 

n04 AACACCCACGA6GGCGGCACCCACQAAGAGGGCTTCCGCAGCGCGCTGACGTCGGTGGT6 

T021 AACACCCACGAGGGCGGCACCCACGAAGAGGGCTTCCGCAGCGCGCTGAC6TCGGTGGTG 

KPM3504 AACACCCATGAGGGCG6GACCCATGAA6AAGGGTTCCGCAGCGCGCTCAC6TCCGTGGTG 

KPHIOOt AACACCCACGAGGGTGGGACCCACGAAGAGGGGTTCCGCAGCGCGCTGACCTCGGTGGTG 660 
***** ** ***** w ****** ******* ***** 

KPM2201 • AACAAGTACGCCAAAGACAAGAAGCT6CTCAAGGAGAAGGACCCGAATCTCACCG6TGAC 

ATCC25274 AACAAATACGCCAAAGACAAGAAGCTGCTGAAAGACAAGGACCCGAACCTCACCGGTGAC 

KPM1 403 AACAAGTACGCCAAAGACAAGAAGCTGCTCAAGGACAAGGATCCCAACCTCACCGGCGAC 

KPM2027 AACAAGTACGCCAAAGACAAGAAACTGCTGAAGGACAAAGATCCCAACCTCACCGGTGAC 

KPM1201 AACAAGTACGCCAAAGACAAGAAGCTGCTCAAGGACAAGGACCCCAACCTCACCGGTGAC 

KPH2403 AACAAGTACGCCAAAGACAAGAAGCTGCTCAAGGAGAAGGACGCCAACCTCACCGGCGAC 

KPM301 2 AACAAGTACGCCAAGGACAA6AAGCTGCTCAAGGACAAGGACCCCAACCTGACCGGCGAC 

Bov inelO AACAA6TACGCCAAGGACAAGAAGCTGCTCAAGGACAAGGACCCCAACCTGACCGGTGAC 

KPM31 01 AACAAGTACGCCAAAGACAAGAAGTTGCTGAAAGACAA6GACCCGAACCTCACCGGCGAC 

KPM3401 AACAAATACGCCAAGGACCGCAAACTCCTGAAGGACAAAGACCCCAACCTCACCGGCGAC 

ATCC5 1 789 AACAAGTACGCCAAGGACCGAAAACTGCTGAAGGACAAGGACCCCAACCTCACCGGCGAC 

T801 AACAAGTACGCCAAGGACCGCAAGCTACTGAAGGACAAGGACCCCAACCTCACCGGTGAC 

T901 AACAAGTACGCCAAGGACCGCAAGCTACTGAAGGACAAGGACCCCAACCTCACCGGTGAC 

T704 AACAAGTACGCCAAGGACCGCAAGCTACTGAAGGACAAGGACCCCAACCTCACCGGTGAC 

T021 AACAAGTACGCCAA6CACCGCAAGCTACTGAAGGACAAG6AC(XJCAACCTCACCGGTGAC 

KPM3S04 AACM€TAC6a:AAGGACCGCAMCTGCTCAAAGACAAGGACCCCAACCTCACC6GCGAC 

KPM1001 AACAAGTACGCCAAG6ACCGCAAACT6CTCAAGGAAAAG6ACCCCAACCTCACCGGCGAC 720 
***** * ** 1 1 1 1 1 *** ** * ** ** ****** ***** ***** *** 
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Fig. 7 . 



KPH2201 GACATCCGKAGGGGTTGGCCGCGGTGATCTCGGT^ 

' ATCC25274 6AWTCCGl^AGGGACTGG(XGCG6T6ATCTCG6TCMG€TCGCCGMC^CCA6TTCGAG 

KPM1403 GACATCCGAGAAGGGCTGGCC(3CSGTGATCrrCCGTGMGGTCG(XGAGCCGCAGTTCGAG 

KPH2027 GACATCCBTGAGGGCrrGGaxraGTCATCTCG^ 

KPH1201 6ACATCCGC<WGG{iGnGGCCGC^CATCTCGGTGCGQGTGGCAGA6CC6WG'rTCGAG 

KPH2403 GACAnCGC^GGQCCTGGCCGCGGTCATCTCGGTGAMGTTGCCGMCCfiCAGTT CGAG 

KPM3012 GACATCCGCGAGGGTTTGGCCGC^GTGATCTCGGTCAAGGT6AGCGAACC6CAGTTCGAG 

Bovine 10 GACATCCGCGAGGGTnGGCCGCGGTGATGTCGGTCAAGGTGAGCGAACCGCAGTTCGAG 

KPM31 01 GACATTCGC^MGSCCTGGCCGCGGTGATCTCGGTCAAGGTCAGCGAACCGCAGTTCGAG 

KPU3401 GACATCOTGGMGGCCTGGCAKGGTCAmCCGTCMGGTCAGCGAACCGCAAnCGAG 

ATCC51 789 GACATTC6TGAG66MTGGCGGCHaGTCATCTCG6TCAAGGTCAGC6AGCCGCAGTTCGAG 

T801 GATATCCGGGMG6CCTGGCCGCT6T6ATCTCGGTGM6GTCAGCGAACC6CA6TTCGAG 

T901 GATATCCGGGAAGGCCTGGCCGCTGTGATCTCGGTGAAGGTCAGCGAACCfiCAGTTCGAG 

T704 GATATCCGGGAAGfiCCTGGCCGCTGTGATCTQjGTGAAGGTCAGCGAACCGCAGTTCGAG 

T021 GATATCCa€GAAGGCCT6GCCGCTGTGATCTC66TGAA©3TCA6C6AACCGCAGTTCGAG 

KPH3504 GACATCCG^AAG6GnGGCCGtX3GT6ATnCGGTCAAA€TCAGCGAACCGCA6TTCGAG 

KPH1001 6ACAT(X!GGGAAG6GnGGC(^CGGTGATTTCGGTCAAGGTCAGCGAGCCGCAGTTC6AG 780 
********** **** ********** ** ****** ***** 

KPM2201 GGTCAGACCAAGACCAAGCTGGGCAACACCGAGGTCAAGTCGTTCGTGCAGAAGGTGTGC 

ATCC25274 GGCCAGACAAAGACCAAGCTGGGCAACACCGAG6TCAAGTCGTTCGTGCAGAAG6TGT6C 

■KPM1403 GGCCAGACTAAGAC6AAACTCGGCAACACCGAGGTCAAGTCGTTTGTCCAGAAAGTCT6T 

KPM2027 GGCCAGACCAA6ACAM6CT6GSCAACACCGAGGTGAAGTCGTTCGTGCAGAAGGTGTGC 

KPU1201 . GSTCAGAC6AAGACCAAGCTGGGCAACACCGA66TCAAGTC6TTT6TCCAGAAGGTTTGT 

KPM2403 GGCCA6ACCAAGACCAAACTGGGTAACACCGAG6TCAA6TCGTTCGTACAGAAGGTCTGC 

KPH3012 GGCCAGACGAAGACCAAACTGGGCAACACCGAGGTGAASTCGTTCGTGCAGAAGGTGTGC 

BovinelO GGCCASACCAAGACCAAACTGGGCAACACCGAGGT6AAGTCGTTCGT6CAGAAGGTGTGC 

KPM31 01 GGTCAGACCAAGACCAAGCTGGGCAACACCGAAGTGAAGTCGTTOjTGCAGAAGGTCTGC 

KPM3401 GGCCAGACCAAAACCAAGCTGGGCAACACCGA6GTCAAGTWTTCGTGCA6MGGTCTGC 

ATCC51 789 GGCCAGACCAAAACCAAACTCGGCAACACCGAAGTCAA6TCGTTTGTGCAGAAGGTCTGC 

T801 G6CCAGACCAAGACCAAGnGGGCAACA(XJGAGGTCAAArCGTTTGTGCAGAAGGTCT6T 

T901 GGCCAGACCAAGACCAAGTTGGGCAACACCGAGGTCAAATCGTTTGTGCAGAAGGTCTGT 

H04 GGCCAGACCAA6ACCAAGTTGGGCAACACCGAG6TCAAATCGTTTGTGCAGAAGGTCTGT 

T021 GGCCAGACCMGACCMGTTGGGCAACACCGAGGTCAAATCGTTTGTGCAGAAGGTCTGT 

KPH3504 GGCCAGACCAAGACGAAACTAGGCAACACCGAGGTGAAGTCGTTCGTGCAGAAGGTGTGC 

KPH1001 GGCCAGACCAAGACGAAACrGGGCAACACCGAGGTGAAGTCGTTCGTGCAGAAGGTGTGC 840 
** ***** ****** * ** ******** ** ** ***** ** ***** ** ** 
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Fig. 8 
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aacgaacagcfcacccactggnc6a66ccaat(x^c6saabctaaaacc6ttgtgaac 
aac6aacagct cacccact 6gncgag6ccaatccgtcggaagccaaaaccgttgt c aac 
aacgaacaactcactcactggncgaggcgmcccgtcxjgaagctaaaaccgttgtaaac 
aacgagcagctcacccactggnc^ggccaactxjatccgaggcqaaaacggtggtgaac 
mcgagcagctcacmactggttcgaggccaatccrrcggaagccaaaaccattgtgaac 
mcgaacagctgacccactggnmaggccaacccgtcgsmgccaaaaccgtcgtgaac 
aaogaacagctcacccactggncgaagccaacc(xxcagacgccaaagtcattgtcaac 
aacgaacagctcacxxactggncgaagccaaccccgcagacgccaaagtcattgtcaac 
mc6mcagctcacccactggncgag6ccaacc(xxjcggacgccaaggtggtggtcaac 
mcgaacagctcacgcactggttc6aagccaacccggcggatgccaaaactgttgtaaac 
aacgmcagctcacccactggttcgaggccmtcccagcgacgccaagaccgtcgtcaac 
a acgmcagctgacck;actgg tttgmgccmccccaccga ctcgaaagtcgttgtgaac 
aacgmcagctgacccactggtttgaagccaaccccaccgactcgaaagtcgttgtgaac 
mtgmcagctgacccactggt ngaa6ccaacc(x)accga ctcsaaagtcgttgtgaac 
a acgaacagctgacccactggm gaa6ccm(xccaccga( ^cgamgtcgngtgaac 
aatgmcagctcacccanggncgaggccaaox^ctgatgctaamccgttgtcaac 
motaacagctcac^attggnmaggwjaaccccgctgacsctaaaaccgttgtcaac 900 

* ** *** 



**■»■* ******** ***** ******** 



***** 



Sequences No. 51, 57 Sequences No. 47, 53 . 

AAAGCGGTGTCGTCCGCCCAGGCGCGGATCGCX^CGCGCAAAGGGCGAGAGCTGGT6CGC 
AAGGCGGTTTCGTCCGCACAGGCCCG6ATCGCGGC6CGGAAGGCCCGAGAGTTGGTGCGG 
AA6GCG6TTTC6TCGGCCCAGGCCCGCATTGCGGCGCGTAAG6CGCGGGAGTTGGTGCGG 
AAAGCGGTGTCGTCGGCTCAGGCGCGCATTGCCGCGCGCAAGGCGCGTGAACTGGTGCGC 
AAGGCGGTATCCTCGGCGCAGGCACGTCTCGCCGC6CGCAAGGCGCGAGAGTT6GTGCGT 
AAGGCGGTCTCGTCGGCACAGGCGCGTATCGCCGCCCGCAAGGCACGAGAGTTGGTGCST 
AAGGCGGTTTCATCAGCGCAGGCGCGCATCGCCGCGCGCAAGGCGCGAGAGTTGGTGCGC 
AAGGCGGTTTCATCAGCGCAGGCGCGCATCGCCGCGCGCAAGGCGCGAGAGTTGGTGCGC 
AAGGCGGTGTC6TCGGCGCAGGCCCGGATCGCCGCGCGCAAGGCGCGAGAGTTGGTGCGT 
AAGGCGGTTTCGTCGGCCCAGGCCC6AATCGCAGCGCGCAAGGCGCGAGAACTGGT6CGC 
AAAGCGGTGTCGTCGGCGCAGGCCCGCATTGCCGCCCGCAAAGCGCGAGAATTGGTGCGC 
AAGGCTGTGTCCTCGGCGCAAGCCCGTATCGCGGCACGTAAGGCAC6AGA6TTGGTGCGG 
AAGGCTGTGTCCTCGGCGCAAGCCCGTATCGCGGCACGTAAGGCACGAGAGTTGGTGCGG 
AAGGCTGTGTCCTCGGCGCAAfiCCtMTATCGCGGCACGTAAGGCACGAGASTTGGTGCGG 
AAGGCTGTGTCCTCGGCGCAAGCCCGTATCGC6GCACGTAAGGCACGAGAGTTGGTGCGG 
AAGGCAGTTTCATCGGCGCAGGCCAGGATTGCGGCCCGCAAGGCGCGCGAGTTGGTGCGC 
AAGGCGGTTTCATCGGCGCAAGCACGCATTGCGGCCCGCAAGGCGCGCGAGTTGGTGCGC 960 
**************** * *************** ******* 
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Fig. 9 

KPM2201 CGCAAGAGCGCAACCGACCTC^ 

ATCC25274 CG(^AGAGCGCGACCGATTT6GSC6G<iCTGCCCGGCAAGCTGfiCCGACTGCCGTTCCACC 

KPM1403 CGTAAGA6TGCTACGGATTTGGGTGGGRG{X^GCMGnGGCTGAnGCCGCTCXaACG 

KPM2027 ttCAAGAGCGCCAOSACCTCGGtt^ 

KPM1201 C«CAAGAGCGCMCC6ATCTCGStGGGCT6CCCGGCMGnGGCCGACTGCCGCTCGACA 

KPM2403 WCAAGAGCGCTACCGATCTCGGTGGGCTGCCCGGCAAGCTGGCCGACTGCCGCTCCACC 

KPM3012 CGCMGAGC6CAACCGACCTGGGCGGGCT6CC<^GCAAGCTCGCC6ACT6CCGGTCGACC 

Bov i ne1 0 CGCMGAGC6CAA(XGACCTGGGCGGGCTGCCCGGCAA6CT0GCCGACTGCCGGTCGACC 

KPM3101 CGCAAGAGCGCCACC^TCTGGGCGGGCTGCMGGt^GCTCGCCGACTGCCGCTCGACG 

KPM3401 CflCMGAGCGC^ACW!ACCTCGGTGGGCT6CCGGGTAAGCTCGCAGACTGCCGCTCCACC 

ATCC51 789 MCMGAGCGCAAaXaATCnGGMGGCTGCCGGGCAAGCTGGCTGACTGCCGCTCGACC 

T801 CGTAAGAGCGCCACWCATWGTGGAnGCCCCGCAAGCTGGCCGATTGCK^TTCCACG 

T901 CGTMGA6(^CA0CGACATCGGTGGAnGC(^GCAAGCTGGCCGAnGCC€7TCCAC6 

T704 CfiTAA6AGCGCCACC6ACATCGGTGGAn6C(XJG6CMGCT6GCR3GATTGCCGTTCCACG 

T021 CGTMGAGCGCCAC^CATCGGTGGATTGWXMGCAAGCTGGCOTAnGCCGTTCCACG 

KPM3504 CGCAAGAGCGCAACCGATCTGG GCGGACTACCGGG CAAGTTG6CCGACTGCCGCTCGA CC 

KPU1001 CGCMGAGCGCAACCGATCTGGGCGGACTACCCGGCAAGCTCGCCGACTGCCGCTCGACC 1 020 
** ***** ****** ***** * ** ** *** ***** ***** ** ** 

KPK2201 GATCCCCGCAAATCCGAACTGTATGTGGTGGAGGGGGACTCCGCCGGCGGCTCGGCCAAG 

ATCC25274 GACCCGCGCAAGTCC6AACTGTATGTGGT6GAGGGTGACTCGGCAGGTGGCTCGGCCAAG 

KPM1403 GATCCGCGGAAGTCTGAGCTGTATGTGGTGGAAGGTGATTCCGCGGGTGGGTCGGCGAAA 

KPM2027 GACCC6CGGAAATCGGAACTGTAT6TGGT6GAGGGCGATTCGGCC66CGGCTC6GCCAAG 

KPM1 201 6ATCC6CGTAAGTCGGAACTGTATGTGGTGGAG6GTGACTC€6CCGGCGGCTCGGCAAAG 

KPM2403 GATCCGCGCAAGTCGGAATT6TATGTGGTGGAAG6GGACTCGGCCGGCGGCTCCGCCAAG 

KPM301 2 GACCCGCGCAAGTCGGAATTGTATGTGGTCGAGGGTGACTCGGCCGGCGGCTCGGCGAAA 

BovinelO GACCCGC6CAAGTC5GAATTGTATGTGGTCGA666TGACTCGGCGGGC66CTC6GCGAAA 

KPM31 01 GATCCGCGCAAGTCGGAACTGTATGTGGT6GAGGGTGATTCGGCCGGCGGCTCGGCGAAG 

KPM3401 GACCCGCGAAAGTCGGAACTGTATGTGGTGGAGGGTGACTCGGCCGGCGGCTCGGCCAA6 

ATCC51 789 GATCCACGCAAGTCCGAATTGTATGTGGTGGAGGGTGATTCG6CCGGCGGCTCGGCCAAG 

T801 GATCCG CGCAAGTCCGAACTGTATGTC GTAGAAGGT6ACTCGGCCGGCGGTTCTGCAAAA 

T901 GATCC GCGCAAGTCCGAACT6TATGTC GTAGAAGGT6ACTCGGCCGGCGGTTCT6CAAAA 

T704 6ATCC 6CGCMGTCCfiAACTGTAT6TC 6TAGAAfiGTGACTCGG(XGGCGGTTCTGCAAAA 

T021 6ATCC GCGCAAGTCC6AACTGTATGT CGTAGAAGGT6ACTCGGCCGGCGGTTCTGCAAAA 

KPM3504 GACCCCCGTAAGTCCGAATTATATGTGGTGGA6GGTGATTCAGCCGGCG6CTCGSCGAAG 

KPM1 001 GACCCGCGCAAGTCCGAACTGTATGTGGTGGAGGGTGATTCAGCCGGCGGCTCGGCGAAG 1080 
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Fig. 10 



KPM2201 AGC^TC^GAtTC6ATGnCWGGCSAncncC6nGCaC6QCAA6ATCATCAACaTC 

ATCC25274 A6CG6CC6TQACT(^TGTrCC^ 

KPM1403 AGT6GGMTGAnC<^TGTTCCAGGCGATCTTGCC6CTGCGCGGCAAGATCATCAACGTC 

KPM2027 AGCGGGCGCGACTCGATGTTCCAGGCIiATCCT^ 

KPM1201 AGTGGCG6CGATTCGATGTTCCAGGCGATCCTGCCGCTGCGCGGCAAGATCATCAATGTC 

KPM24Q3 AGCGGCC6C6ACTCGATGnTCAGG<MATACnCCGnGC8CGSCAAfiATCATCAACGTC 

KPM3012 AGCGGCCGGGACTCGATGTTCCAGGCCATCCTTCCGCTGCGCG6CAAGATCATCAACGTC . 

Bov inel 0 AGC6GCCG66ACTCGATGTTCCAGGCCATCCTTCC6CTGC8CGGCAA6ATCATCAACQTC 

KPM3101 AGCGGCC6CGACTCGATGTTCCAGGCCATCCTGCCGCTGCGCGGCAAGATCATCAACGTC 

KPM3401 AGCGGCCGCGACTCGATGTTCCAGGCGATCCTCCCGCTGCGrGGCAAGAtCATCAACGTC 

ATCC51 789 AGCGGCCGC6ACTCGATGTTTCAGGC6ATCCTGCCGTTGCGGGGCAAGATCATCAACGTG 

T801 AG(^TCGCGATTCGAT6nCCAGGCGATACnCCGCTGCG0G6CAAGATCATCAAT6TG 

T901 AGCG6T0GCGA7TCGATGTTCCAGGCGATACTTCCGCTGiDGC66CAAGATCATCAATGT6 

T704 AGCGSTCGCGATTCWTGnCCAGGCGATACnCCGCTGCGMGCAAGATCATCAATGTG 

T021 AGGGGTC6CGAnCGAT6TTMAGG(XiATACTTCC6CTGCGCGGCAAGATCATCAATGTG 

KPM3504 AGC6GK*C6ACTCGATGTnCAAG(^TCn6CCGTTGCGCGGCMGATCATCMCGTC 

KPH1001 A6{XffiTC6CGACTCGATGTTCCAGGCCATCTTG0C6TTGCGC6GCAAGATCATCAACGTC 1 1 40 

«M* ♦ 3 W D W°i B W* a M» ♦ +++ +-+4 + +++++++ + + + ++++ 

KPM2201 GAGAAG6CCCGCATCGACCG6GTGCTGAA6AACACCGAAGTCCAGGCCATCATCACCGCG 
ATCC25274 GAGAAG6CCCGCATC6ACCGGGTCCTGAAGAACACCGAAGTCCAGGCGATCATCACCGCG 
KPM1403 GAAAAGGCCCGCATCGATCGGGTGCTGAAAAACACCGAAGTCCAGGCCATCATCACCGCG 
KPM2027 6AGAAGGCCCGCATCGACCGGGTGCTGAAGAACACCGMGTTCAGGCGATCATCACCGCG 
KPM1201 GAAAA6GCACGCATCGAGCGA6TCCTGAAAAACACTGAAGTCCAGGCGATCATCACC6CG 
KPM2403 GAGMGGCCCGCATCGACCGGGTGCTGAAGAACACCGAAGTCCAGGCGATCATCACCGCG 
KPM301 2 GAAAAGGCCCGCATCGACCGGGTGGTGAAGAACACCGAGGTGCAGGCGATCATCACCGCG 
Bov i nel 0 GAAAAG6CCCGCATCGACCGGGTGCTGAAGAACAKGAGGTGCAGGCGATCATCACCGCG 
KPM31 01 GAGAAGGCCCGCATCGACCGGGTGTTGAAGAACACCGAGGTGCAGGCCATCATCACCGCC 
KPM3401 GAGAAGGCGCGCATCGACCGGGTGCTGAAGAACACC6AAGTTCAGGCGATCATCACC6CG 
ATCC51 789 GAGAAGGCCCGCATCGACCG6GTGCTGAA6AACACTGAGGT6CAGGCGATCATCACCGCG 
T801 GAGAAAGCGC6CATCGACCGGGTGCTAAAGAACACCGAAGTTCAGSCGATCATCACGGCG • 

T901 GAGAAAGCGCGCATC6ACCGGGTGCTAAAGAACACC6AAGTTCAGGCGATCATCACGSCG 
T704 6AGAAAGCGCGCATCGACCGG6TGCTAAAGAACACCGAA6TTCAGGC6ATCATCACGGC6 
T02I 6AGAAAGCGCGCATCGACCGGGTGCTAAAGAACACCGAAGTTCA66CGATCATCACGGCG 
KPM3504 GAGAAGGCCCGCATCGACC6GGTGCTGAAGAACACCGAA6TCCAGGCGATCATCACGGCG 
KPM1001 6AGAAG6CCCGCATCGACCGGGTGCTGAAGAACACCGAAGTCCAGGCGATCATCACCGCG 1 200 

±sk ** i *ii iin ki i i i ik dak afc sfc* dakskdak rfufc 4ofc J rf pfa fc* f fafcatai t fr ** 
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Fig. 11 

KPM2201 CTGGGCACCGGGATeCACGAOSAGTTC6ACATCACCAAACTSCfiCTACCACAA6ATC6TA 

ATCC25274 CTGGGTACCGGTAn(^CGACWGtTCGACATTTCTAAACTG(X3TTACCACAAGATCGTG 

KPH1403 CTGGGCACCGGCATCCA(^CGAATTCGACATCACCAAACTGCGTTACCACAAGATCGT6 

KPH2027 CTGGGTACKGGATTCACCACGAGnCGACATCACCAAGCTGCGCTATCACAAGATCGTG 

KPH1201 nGGGTACCGGTATTCACGACGAATTCGACCTCTCGAAGCTGCGCTATCACAAGATCGTC 

KPH2403 CTGGGTACCG6AATTCAC6ACGAGTTCGACCTCGCCAAACTGCGCTACCACAAGATCGTG ■ 

KPM3012 CTGGGCACCGGGATTCAC6AC6ASTTCGACATCACCAAGCTflCBCTACCACAAGATCGTG 

BovlnelO CTGGGCACC^GATTCACGACGAGTTCGACATCACCAAGCTGCGCTACCACAAGATCGTG 

KPH31 01 CTGG6CACCGGCATCCAC6ACGAGTTCGACATCACCAAGCTGCGCTATCACAAGATCGTG 

KPM3401 CTGGGCACGGGGATTCACGACGAGTTCGACATCACCAAGCTCCGGTACCACAAGATCGTG 

ATCC51789 CTGGGCACGGGGATTCAC6ACGAGTTC6ACATCTCCAAGCTGCGCTACCACAAGATC6TG 

TS01 CTGGSCACCGGGATCCACfiACGAGTTCGATATCGGCAAGCTGCSCTACCACAAGATCGTQ 

T301 CTGGGCACCGGGATCCACfiACGAGTTCGATATCGGCAAGCTGCGCTACCACAAGATCGTG 

T704 CT6GGCACCGGGATCCAC6ACGAGTTCGATATCGGCAAGCTGCGCTACCACAAGATCGTG 

T021 CTGGGCACCGGGATCCAC6ACGAGTTC6ATATCGGCAAGCTGC6CTACCACAA6ATCGTG 

KPM3504 nGGGCATOGSTATTCACGACGAATTCSACATCGCGAGACTGCGTTACCACAAGATCGTG 

KPM1001 TTGGGTACCGGCATCCACGACGAATTCGACATCGCGAGACTGCGTTACCACAAGATCGTG 1^60 

-. 

AafafcA jfc* ^ nt B te fcA A* • + + ■+ + + jk 3k -Mr U «HH^rt i f ♦ t 
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Fig. 14 



H1234S8789H 
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Sequence Sequence Sequence Sequence 



5&7 9&11 S&13 15&17 
I "11 II II 1 

M12 3 4 123412341234 
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Fig. 16 
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